SEQUENCE LISTING 

<110> Wang, Danher 

<12 0> GENETIC VACCINE AGAINST HUMAN IMMUNODEFICIENCY VIRUS 

<130> 22488-712 

<150> 09/585,599 

<151> 2000-06-02 

<150> PCT/US01/1823 8 

<151> 2001-06-04 

<160> 75 

<170> Patentln version 3.1 

<210> 1 

<211> 7 

<212> DNA 

<213> Ebola virus 

<400> 1 
ttttttt 



<210> 2 

<211> 7 

<212> RNA 

<213> Ebola virus 

<400> 2 
uuuuuuu 



<210> 3 

<211> 8 

<212> RNA 

<213> Artificial sequence 
<220> 

<223> Modified RNA editing site 

<400> 3 
uucuucuu 



<210> 4 

<211> 7 

<212> DNA 

<213> Ebola virus 

<400> 4 
aaaaaaa 



<210> 5 

<211> 8 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> DNA of modified RNA editing site 

<400> 5 
aagaagaa 



<210> 6 

<211> 6 

<212> RNA 

<213> Ebola virus 

<400> 6 

uuuuuu 6 



<400> 7 
uucuuc 



<210> 7 

<211> 6 

<212> RNA 

<213> Artificial sequence 
<220> 

H <223> Modified RNA editing site 

□ 

sF=r 

O <210> 8 

yj <2n> 6 

||1 <212> DNA 

J <213> Artificial sequence 

f7 <220> 

if <223> DNA of modified RNA editing site 

M <400> 8 

n ttcttc 



<210> 9 

<211> 21 

<212> PRT 

<213> Homo sapiens 



<400> 9 

Ser Trp Leu Leu Leu Leu Leu Leu Ser Leu Ser Leu Leu Gin Ala Thr 
1 5 10 15 



Asp Phe Met Ser Leu 
20 



<210> 10 

<211> 5 

<212> PRT 

<213> Ebola virus 



<400> 10 



Arg Arg Thr Arg Arg 
1 5 



<210> 11 

<211> 4 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 11 

Arg Glu Lys Arg 
1 



<210> 12 

<211> 9 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA of GAG site 

<400> 12 
ggagctggt 



<210> 13 

<211> 3 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> GAG site 

<400> 13 

Gly Ala Gly 
1 



<210> 14 
<211> 3157 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Env/Tat/Rev 
<400> 14 

gaattctgca acaactgctg tttatccatt 
ggcgttactc gacagaggag agcaagaaat 
gaagcatcca ggaagtcagc ctaaaactgc 
ctttcattgc caagtttgtt tcataacaaa 
gcggagacag cgacgaagac ctcctcaagg 



ttcagaattg ggtgtcgaca tagcagaata 
ggagccagta gatcctagac tagagccctg 
ttgtaccaat tgctattgta aaaagtgttg 
agccttaggc atctcctatg gcaggaagaa 
cagtcagact catcaagttt ctctatcaaa 



gcagtaagta gtacatgtaa 
aataataata gcaatagttg 
acaaagaaaa atagacaggt 
tgagagtgaa ggagaaatat 
tccttgggat gttgatgatc 
gggtacctgt gtggaaggaa 
atgatacaga ggtacataat 
cacaagaagt agtattggta 
tagaacagat gcatgaggat 
aattaacccc actctgtgtt 
atagtagtag cgggagaatg 
tcagcacaag cataagaggt 
taataccaat agataatgat 
ttacacaggc ctgtccaaag 
ctggttttgc gattctaaaa 
atgtcagcac agtacaatgt 
taaatggcag tctggcagaa 
ctaaaaccat aatagtacag 
acaatacaag aaaaagtatc 
gaaaaatagg aaatatgaga 
ctttaaaaca gatagatagc 
ttaagcagtc ctcaggaggg 
aatttttcta ctgtaattca 
gtactaaagg gtcaaataac 
aacaaattat aaacatgtgg 
gacaaattag atgttcatca 
gcaacaatga gtccgagatc 
gtgaattata taaatataaa 
caaagagaag agtggtgcag 
ggttcttggg agcagcagga 



tgcaacctat acaaatagca 
tgtggtccat agtaatcata 
taattgatag actaatagaa 
cagcacttgt ggagatgggg 
tgtagtgcta cagaaaaatt 
gcaaccacca ctctattttg 
gtttgggcca cacatgcctg 
aatgtgacag aaaattttaa 
ataatcagtt tatgggatca 
agtttaaagt gcactgattt 
ataatggaga aaggagagat 
aaggtgcaga aagaatatgc 
actaccagct atacgttgac 
gtatcctttg agccaattcc 
tgtaataata agacgttcaa 
acacatggaa ttaggccagt 
gaagaggtag taattagatc 
ctgaaccaat ctgtagaaat 
cgtatccaga gaggaccagg 
caagcacatt gtaacattag 
aaattaagag aacaatttgg 
gacccagaaa ttgtaacgca 
acacaactgt ttaatagtac 
actgaaggaa gtgacacaat 
caggaagtag gaaaagcaat 
aatattacag ggctgctatt 
ttcagacctg gaggaggaga 
gtagtaaaaa ttgaaccatt 
agagaaaaaa gagcagtggg 
agcactatgg gcgcagcgtc 



atagtagcat tagtagtagc 
gaatatagga aaatattaag 
agagcagaag acagtggcaa 
gtggagatgg ggcaccatgc 
gtgggtcaca gtctattatg 
tgcatcagat gctaaagcat 
tgtacccaca gaccccaacc 
catgtggaaa aatgacatgg 
aagcctaaag ccatgtgtaa 
gaagaatgat actaatacca 
aaaaaactgc tctttcaata 
atttttttat aaacttgata 
aagttgtaac acctcagtca 
catacattat tgtgccccgg 
tggaacagga ccatgtacaa 
agtatcaact caactgctgt 
tgccaatttc acagacaatg 
taattgtaca agacccaaca 
gagagcattt gttacaatag 
tagagcaaaa tggaataaca 
aaataataaa acaataatct 
cagttttaat tgtggagggg 
ttggtttaat agtacttgga 
caccctccca tgcagaataa 
gtatgcccct cccatcagtg 
aacaagagat ggtggtaata 
tatgagggac aattggagaa 
aggagtagca cccaccaagg 
aataggagct ttgttccttg 
aatgacgctg acggtacagg 



m 



ccagacaatt 


attgtctggt 


ataqtqcaqc 


agcagaacaa 


tttgctgagg 


gctattgagg 


2160 


cgcaacagca 


tctgttgcaa 


ctcacagtct 


ggggcatcaa 


gcagctccag 


gcaagaatcc 


2220 


tqqctqtqqa 


aagataccta 


aaggatcaac 


agctcctggg gatttggggt 


tgctctggaa 


2280 


aactcatttg 


caccactgct 


qtqccttqqa 


atgctagttg 


gagtaataaa 


tctctggaac 


2340 


agatttggaa 


taacatgacc 


tqqatqqaqt 


gggacagaga 


aattaacaat 


tacacaagct 


2400 


taatacactc 


cttaattgaa 


gaatcgcaaa 


accagcaaga 


aaagaatgaa 


caagaattat 


2460 


tggaattaga 


taaatgggca 


agtttgtgga 


attggtttaa 


cataacaaat 


tggctgtggt 


2520 


atataaaatt 


attcataatg 


atagtaggag 


gcttggtagg 


tttaagaata 


gtttttgctg 


2580 


tactttctgt 


agtgaataga 


qttaqqcaqq 


gatattcacc 


attatcgttt 


cagacccacc 


2640 


tcccaatccc 


qaqqqqaccc 


qacaqqcccq 


aaggaataga 


agaagaaggt 


ggagagagag 


2700 


acagagacag 


atccattcga 


ttagtgaacg 


gatccttagc 


acttatctgg 


gacgatctgc 


2760 


ggagcctgtg 


cctcttcagc 


taccaccgct 


tgagagactt 


actcttgatt 


gtaacgagga 


2820 


ttcrtaoaact 

^— v*- v^^n ~j w* w 


tctqqqacqc 


aqqqqqtqqq 


aagccctcaa 


atattggtgg 


aatctcctac 


2880 


agtattggag 


tcaggagcta 


aagaatagtg 


ctgttagctt 


gctcaatgcc 


acagctatag 


2940 


cagtagctga 


ggggacagat 


agggttatag 


aagtagtaca 


aggagcttat 


agagctattc 


3000 


gccacatacc 


tagaagaata 


agacagggct 


tggaaaggat 


tttgctataa 


gatgggtggc 


3060 


aagtggtcaa 


aaagtagtgt 


ggttggatgg 


cctgctgtaa gggaaagaat 


gagacgagct 


3120 


gagccagcag 


cagatggggt 


gggagcagca 


tctcgag 






3157 


<210> 15 
<211> 508 
<212> DNA 

<213> Artificial sequence 










<220> 

<223> Modified IL-2 












<400> 15 
tcactctctt 


taatcactac 


tcacagtaac 


ctcaactcct 


gccacaatgt 


acaggatgca 


60 


actcctgtct 


tgcattgcac 


taagtcttgc 


acttgtcaca 


aacagtgcac 


ctacttcaag 


120 


ttctacaaag 


aaaacacagc 


tacaactgga 


gcatttactg 


ctggatttac 


agatgatttt 


180 


gaatggaatt 


aataattaca 


agaatcccaa 


actcaccagg 


atgctcacat 


ttaagtttta 


240 


catgcccaag 


aaggccacag 


aactgaaaca 


tcttcagtgt 


cttgaagaag 


aactcaaacc 


O A A 
J 0 U 


tctggaggaa gtgctaaatt 


tagctcaaag 


caaaaacttt 


cacttaagac 


ccagggactt 


360 


aatcagcaat 


atcaacgtaa 


tagttctgga 


actaaaggga 


tctgaaacaa 


cattcatgtg 


420 


tgaatatgct 


gatgagacag 


caaccattgt 


agaatttctg 


aacagatgga 


ttaccttttg 


480 





tcaaagcatc 


atctcaacac 


taacttga 




<210> 16 
<211> 2280 
<212> DNA 

<213> Artificial sequence 




<220> 

<223> Modified Env 






<400> 16 
gaattcgcca 


ccatgggagt 


gaaggagaaa 




tggggcacca 


tgctccttgg 


gatgttgatg 




acagtctatt 


atggggtacc 


tgtgtggaag 




gatgctaaag 


catatgatac 


agaggtacat 




acagacccca 


acccacaaga 


agtagtattg 




aaaaatgaca 


tggtagaaca 


gatgcatgag 


~ — if 


aagccatgtg 


taaaattaac 


cccactctgt 


yj 


gatactaata 


ccaatagtag 


tagcgggaga 




tgctctttca 


atatcagcac 


aagcataaga 


m 


tataaacttg 


atataatacc 


aatagataat 




aacacctcag 


tcattacaca 


ggcctgtcca 


o 

□ 


tattgtgccc 
ggaccatgta 
actcaactgc 


cggctggttt 
caaatgtcag 
tgttaaatgg 


tgcgattcta 
cacagtacaa 
cagtctggca 




ttcacacraca 


atgctaaaac 


cataatagta 




acaagaccca 


acaacaatac 


aagaaaaagt 




tttgttacaa 


taggaaaaat 


aggaaatatg 




aaatggaata 


acactttaaa 


acagatagat 




aaaacaataa 


tctttaagca 


gtcctcagga 




aattgtggag 


gggaattttt 


ctactgtaat 




aatagtactt 


ggagtactaa 


agggtcaaat 




ccatgcagaa 


taaaacaaat 


tataaacatg 




cctcccatca 


gtggacaaat 


tagatgttca 




gatggtggta 


atagcaacaa 


tgagtccgag 



508 



tatcagcact 


tgtggagatg 


ggggtggaga 


60 


atctgtagtg 


ctacagaaaa 


attgtgggtc 


120 


gaagcaacca 


ccactctatt 


ttgtgcatca 


180 


aatgtttggg 


ccacacatgc 


ctgtgtaccc 


240 


gtaaatgtga 


cagaaaattt 


taacatgtgg 


300 


gatataatca 


gtttatggga 


tcaaagccta 


360 


gttagtttaa 


agtgcactga 


tttgaagaat 


420 


atgataatgg 


agaaaggaga 


gataaaaaac 


480 


ggtaaggtgc 


agaaagaata tgcatttttt 


540 


gatactacca 


gctatacgtt 


gacaagttgt 


600 


aaggtatcct 


ttgagccaat 


tcccatacat 


660 


aaatgtaata 


ataagacgtt 


caatggaaca 


720 


tgtacacatg 


gaattaggcc 


agtagtatca 


780 


gaagaagagg 


tagtaattag 


atctgccaat 


840 


cagctgaacc 


aatctgtaga 


aattaattgt 


900 


atccgtatcc 


agagaggacc 


agggagagca 


960 


agacaagcac 


attgtaacat 


tagtagagca 


1020 


agcaaattaa 


gagaacaatt 


tggaaataat 


1080 


ggggacccag 


aaattgtaac 


gcacagtttt 


1140 


tcaacacaac 


tgtttaatag 


tacttggttt 


1200 


aacactgaag 


gaagtgacac 


aatcaccctc 


1260 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


1320 


tcaaatatta 


cagggctgct 


attaacaaga 


1380 


atcttcagac 


ctggaggagg 


agatatgagg 


1440 



aacaattqqa 


qaaqtqaatt 


atataaatat 


gcacccacca 


aggcaaagag 


aaqaqtqqtq 


ttccttgggt 


tcttqqqaqc 


agcaggaagc 


gtacaggcca 


gacaattatt 


gtctggtata 


attgaggcgc 


aacagcatct 


gttgcaactc 


agaatcctgg 


ctqtqqaaaq 


atacctaaag 


tctggaaaac 


tcatttgcac 


cactgctgtg 


ctaaaacaaa 


tttggaataa 


catgacctgg 


acaagcttaa 


tacactcctt 


aattgaagaa 


aaattattaa 


aattagataa 


atgggcaagt 


ctgtggtata 


taaaattatt 


cataatgata 


tttgctgtac 


tttctgtagt 


gaatagagtt 


acccacctcc 


caatcccgag 


gggacccgac 


gagagagaca 


gagacagatc 


cattcgatta 



u 

Q <210> 17 

UJ <211> 1496 

|j1 <212> DNA 

B <213> Artificial sequence 

M 

: . <220> 

<223> Full length Gag 

r—s 

M 

M <400> 17 

p ggctagaagg agagaggatg ggtgcgagag 

gatgggaaaa aattcggtta aggccagggg 
tatgggcaag cagggagcta gaacgactac 
ttagatcatt atataataca gtagcaaccc 
aagacaccaa ggaagcttta gacaagatag 
cacagcaagc agcagctgac acaggacaca 
tgcagaacat ccaggggcaa atggtacatc 
gggtaaaagt agtagaagag aaggctttca 
tatcagaagg agccacccca caagatttaa 
aagcagccat gcaaatgtta aaagagacca 
tacatccagt gcatgcaggg cctattgcac 
acatagcagg aactactagt acccttcagg 



aaagtagtaa 


aaattgaacc 


attaggagta 


1500 


cagactagtg 


caqtqqgaat 

ZJ ZJ ZJ ZJ 


aggagctttg 


1560 


actatqqqcq 

ZJZJZJ ZJ 


cagcgtcaat 


gacgctgacg 


1620 


qtqcaqcaqc 


agaacaattt 


qctgagqgct 

ZJ 3 ZJ ZJ ZJ 


1680 


acaqtctgqg 

— J ^ ^ zj zj zj 


gcatcaagca 


gctccaggca 


1740 


gatcaacagc 


tcctggggat 


ttggggttgc 


1800 


ccttggaatg 


ctagttggag 
_* — > — > — ' 


taataaatct 


1860 


atqqaqtgqg 

-3 -3 ^ ZJ ZJ ZJ 


acagagaaat 


taacaattac 


1920 


tcgcaaaacc 


agcaagaaaa 


gaatgaacaa 


1980 


ttgtggaatt 


ggtttaacat 


aacaaattgg 


2040 


gtaggaggct 


tggtaggttt 


aagaatagtt 


2100 


aggcagggat 


attcaccatt 


atcgtttcag 


2160 


aggcccgaag 


gaatagaaga 


agaaggtgga 


2220 


gtgaacggat 


ccttagcact 


tatctggtaa 


2280 


cgtcagtatt 


aagcggggga 


gaattagatc 


60 


gaaagaaaaa 


atataaatta 


aaacatatag 


120 


aaccatccct 


tcagacagga 


tcagaagaac 


180 


tctattgtgt 


gcatcaaagg 


atagagataa 


240 


aggaagagca 


aaacaaaagt 


aagaaaaaag 


300 


qcaqtcaqqt 


cagccaaaat 


taccctatag 


360 


aggccatatc 


acctagaact 


ttaaatgcat 


420 


gcccagaagt 


aatacccatg 


ttttcagcat 


480 


acaccatgct 


aaacacagtg 


gggggacatc 


540 


tcaatgagga 


agctgcagaa 


tgggatagag 


600 


caggccagat 


gagagaacca 


aggggaagtg 


660 


aacaaatagg 


atggatgaca 


aataatccac 


720 



5 



Mi 



ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


aaaatagtaa 


780 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


ccttttagag 


840 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


gaggtaaaaa 


900 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


actattttaa 


960 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


ggagtaggag 


1020 


gacccggcca 


taaggcaaga 


gttttggctg 


aagcaatgag 


ccaagtaaca 


aatacagcta 


1080 


ccataatgat 


gcagagaggc 


aattttagga 


accaaagaaa 


gatggttaag 


tgtttcaatt 


1140 


gtggcaaaga 


agggcacaca 


gccagaaatt 


gcagggcccc 


taggaaaaag 


ggctgttgga 


1200 


aatgtggaaa 


ggaaggacac 


caaatgaaag 


attgtactga 


gagacaggct 


aattttttag 


1260 


ggaagatctg 


gccttcctac 


aagggaaggc 


cagggaattt 


tcttcagagc 


agaccagagc 


1320 


caacagcccc 


accatttctt 


cagagcagac 


cagagccaac 


agccccacca 


gaagagagct 


1380 


tcaggtctgg 


ggtagagaca 


acaactcccc 


ctcagaagca 


ggagccgata 


gacaaggaac 


1440 


tgtatccttt 


aacttccctc 


agatcactct 


ttggcaacga 


cccctcgtca 


caataa 


1496 



g 
o 

W <210> 18 

□ <211> 492 

yd <212> PRT 

\J\ <213> Human immunodeficiency virus type 1 



<400> 18 



^ Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 

pi 5 10 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
20 25 30 



His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Leu Gin Pro Ser Leu 
35 40 45 



Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr 
50 * 55 60 



Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr Lys Glu Ala 
65 " 70 ~ 75 80 



Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin 
85 90 95 



Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr 
100 105 110 



Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser 
115 120 125 



Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe 
130 135 140 



Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr 
145 150 155 160 



Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala 
165 170 175 



Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp 
180 185 190 



Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met 
195 200 205 



Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin 
210 215 220 



Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu 
225 230 235 240 



He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met 
245 250 255 



Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro 
260 265 270 



Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin 
275 "* 280 285 



Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin 
290 295 300 



Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala 
305 310 315 320 



Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro 
325 330 335 



Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn 
340 345 350 



Thr Ala Thr lie Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys 
355 360 365 



□ 



Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His Thr Ala Arg Asn 
370 ~ 375 380 



Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
385 390 395 400 



His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
405 410 415 



lie Trp Pro Ser Tyr Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg 
420 425 430 



Pro Glu Pro Thr Ala Pro Pro Phe Leu Gin Ser Arg Pro Glu Pro Thr 
435 440 445 



Ala Pro Pro Glu Glu Ser. Phe Arg Ser Gly Val Glu Thr Thr Thr Pro 
450 455 460 



s 

O Pro Gin Lys Gin Glu Pro lie Asp Lys Glu Leu Tyr Pro Leu Thr Ser 

LU 465 470 475 480 

m 

* ± Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin 

f 53 * 485 490 

s a 

r 3 



<210> 
<211> 
<212> 
<213> 


19 

2941 
DNA 

Artificial sequence 










<220> 
<223> 


Modified Env from HIV strain pNL4-3 








<400> 19 
gaattctgca 


acaactgctg 


tttatccatt 


tcagaattgg gtgtcgacat 


agcagaatag 


60 


gcgttactcg 


acagaggaga 


gcaagaaatg 


gagccagtag atcctagact 


agagccctgg 


120 


aagcatccag 


gaagtcagcc 


taaaactgct 


tgtaccaatt 


gctattgtaa 


aaagtgttgc 


180 


tttcattgcc 


aagtttgttt 


catgacaaaa 


gccttaggca 


tctcctatgg 


caggaagaag 


240 


cggagacagc 


gacgaagagc 


tcatcagaac 


agtcagactc 


atcaagcttc 


tctatcaaag 


300 


cagtaagtag 


tacatgtaat 


gcaacctata 


atagtagcaa 


tagtagcatt 


agtagtagca 


360 


ataataatag 


caatagttgt 


gtggtccata 


gtaatcatag 


aatataggaa 


aatattaaga 


420 


caaagaaaaa 


tagacaggtt 


aattgataga 


ctaatagaaa 


gagcagaaga 


cagtggcaat 


480 



y 
o 

y, 



gagagtgaag 


gagaagtatc 


agcacttgtg 


gagatggggg 


tggaaatggg 


gcaccatgct 


540 


ccttgggata 


ttgatgatct 


gtagtgctac 


agaaaaattg 


tgggtcacag 


tctattatgg 


600 


ggtacctgtg 


tggaaggaag 


caaccaccac 


tctattttgt 


gcatcagatg 


ctaaagcata 


660 


tgatacagag 


gtacataatg 


tttgggccac 


acatgcctgt 


gtacccacag 


accccaaccc 


720 


acaagaagta 


gtattggtaa 


atgtgacaga 


aaattttaac 


atgtggaaaa 


atgacatggt 


780 


agaacagatg 


catgaggata 


taatcagttt 


atgggatcaa 


agcctaaagc 


catgtgtaaa 


840 


attaacccca 


ctctgtgtta 


gtttaaagtg 


cactgatttg 


aagaatgata 


ctaataccaa 


900 


tagtagtagc 


gggagaatga 


taatggagaa 


aggagagata 


aaaaactgct 


ctttcaatat 


960 


cagcacaagc 


ataagagata 


aggtgcagaa 


agaatatgca 


ttcttttata 


aacttgatat 


1020 


agtaccaata 


gataatacca 


gctataggtt 


gataagttgt 


aacacctcag 


tcattacaca 


1080 


ggcctgtcca 


aaggtatcct 


ttgagccaat 


tcccatacat 


tattgtgccc 


cggctggttt 


1140 


tgcgattcta 


aaatgtaata 


ataagacgtt 


caatggaaca 


ggaccatgta 


caaatgtcag 


1200 


cacagtacaa 


tgtacacatg 


gaatcaggcc 


agtagtatca 


actcaactgc 


,tgttaaatgg 


1260 


cagtctagca 


gaagaagatg 


tagtaattag 


atctgccaat 


ttcacagaca 


atgctaaaac 


1320 


cataatagta 


cagctgaaca 


catctgtaga 


aattaattgt 


acaagaccca 


acaacaatac 


1380 


aagaaaaagt 


atccgtatcc 


agaggggacc 


agggagagca 


tttgttacaa 


taggaaaaat 


1440 


aggaaatatg 


agacaagcac 


attgtaacat 


tagtagagca 


aaatggaatg 


ccactttaaa 


1500 


acagatagct 


agcaaattaa 


gagaacaatt 


tggaaataat 


aaaacaataa 


tctttaagca 


1560 


atcctcagga 


ggggacccag 


aaattgtaac 


gcacagtttt 


aattgtggag 


gggaattttt 


1620 


ctactgtaat 


tcaacacaac 


tgtttaatag 


tacttggttt 


aatagtactt 


ggagtactga 


1680 


agggtcaaat 


aacactgaag 


gaagtgacac 


aatcacactc 


ccatgcagaa 


taaaacaatt 


1740 


tataaacatg 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


cctcccatca 


gtggacaaat 


1800 


tagatgttca 


tcaaatatta 


ctgggctgct 


attaacaaga 


gatggtggta 


ataacaacaa 


1860 


tgggtccgag 


atcttcagac 


ctggaggagg 


cgatatgagg 


gacaattgga 


gaagtgaatt 


1920 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


gcacccacca 


aggcaaagag 


1980 


aagagtggtg 


cagactagtg 


cagtgggaat 


aggagctttg 


ttccttgggt 


tcttgggagc 


2040 


agcaggaagc 


actatgggct 


gcacgtcaat 


gacgctgacg 


gtacaggcca 


gacaattatt 


2100 


gtctgatata 


gtgcagcagc 


agaacaattt 


gctgagggct 


attgaggcgc 


aacagcatct 


2160 


gttgcaactc 


acagtctggg 


gcatcaaaca 


gctccaggca 


agaatcctgg 


ctgtggaaag 


2220 


atacctaaag 


gatcaacagc 


tcctggggat 


ttggggttgc 


tctggaaaac 


tcatttgcac 


2280 



r 





cactactQtQ 


ccttggaatg 


ctagttggag 


taataaatct 


ctggaacaga 


tttggaataa 


2340 




catgacctgg 


atqgaqtqqq 

ZJ ZJ ZJ ZJ ZJ ZJ 


acagagaaat 


taacaattac 


acaagcttaa 


tacactcctt 


2400 




aattgaagaa 


tcgcaaaacc 


agcaagaaaa 


gaatgaacaa 


gaattattgg 


aattagataa 


2460 




atgggcaagt 


ttgtggaatt 


ggtttaacat 


aacaaattgg 


ctgtggtata 


taaaattatt 


2520 




cataatgata 


qtagqaqgct 

ZJ ZJ ZJ ZJ Zi 


tggtaggttt 


aagaatagtt 


tttgctgtac 


tttctatagt 


2580 




qaataqaqtt 

j **** w 3 -3 


aqqcaqqgat 

ZJZJ ZjZjZj 


attcaccatt 


atcgtttcag 


acccacctcc 


caatcccgag 


2640 




qqqacccqac 


aggcccgaag 


gaatagaaga 


agaaggtgga gagagagaca gagacagatc 


2700 




cattcaatta 


otaaacqqat 


ccttagcact 


tatctgggac gatctgcgga gcctgtgcct 


2760 




cttcagctac 


caccgcttga 


gagacttact 


cttgattgta 


acgaggattg 


tggaacttct 


2820 




gggacgcagg 


gggtgggaag 


ccctcaaata 


ttggtggaat 


ctcctacagt 


attggagtca 


2880 




ggaactaaag 


aatagtgctg 


ttaacttgct 


caatgccaca 


gccatagcag 


tagctgagta 


2940 


□EOOOT 


a 

<210> 20 
<211> 2746 
<212> DNA 

<213> Artificial sequence 








2941 


£ « ; 

in 


<220> 

<223> Modified Env/Tat/Rev from 


pNL4-3 








U 


<400> 20 
gaattctgca 


acaactgctg 


tttatccatt 


tcagaattgg gtgtcgacat 


agcagaatag 


60 


0 


gcgttactcg 


acagaggaga 


gcaagaaatg 


gagccagtag 


atcctagact 


agagccctgg 


120 




aagcatccag 


gaagtcagcc 


taaaactgct 


tgtaccaatt 


gctattgtaa 


aaagtgttgc 


180 




tttcattgcc 


aagtttgttt 


catgacaaaa 


gccttaggca 


tctcctatgg 


caggaagaag 


240 




cggagacagc 


gacgaagagc 


tcatcagaac 


agtcagactc 


atcaagcttc 


tctatcaaag 


300 




cagtaagtag 


tacatgtaat 


gcaacctata 


atagtagcaa 


tagtagcatt 


agtagtagca 


360 




ataataatag 


caatagttgt 


gtggtccata 


gtaatcatag 


aatataggaa 


aatattaaga 


420 




caaagaaaaa 


tagacaggtt 


aattgataga 


ctaatagaaa gagcagaaga cagtggcaat 


480 




gagagtgaag 


gagaagtatc 


agcacttgtg 


gagatggggg 


tggaaatggg gcaccatgct 


540 




ccttgggata 


ttgatgatct 


gtagtgctac 


agaaaaattg 


tgggtcacag 


tctattatgg 


600 




ggtacctgtg 


tggaaggaag 


caaccaccac 


tctattttgt 


gcatcagatg 


ctaaagcata 


660 




tgatacagag 


gtacataatg 


tttgggccac 


acatgcctgt 


gtacccacag 


accccaaccc 


720 




acaagaagta 


gtattggtaa 


atgtgacaga 


aaattttaac 


atgtggaaaa atgacatggt 


780 




agaacagatg 


catgaggata 


taatcagttt 


atgggatcaa 


agcctaaagc 


catgtgtaaa 


840 



•- s 



ye 



b 



attaacccca 


ctctgtgtta 


gttgtaacac 


ctcagtcatt 


acacaggcct 


gtccaaaggt 


900 


atcctttgag 


ccaattccca 


tacattattg 


tgccccggct 


ggttttgcga 


ttctaaaatg 


960 


taataataag 


acgttcaatg 


gaacaggacc 


atgtacaaat 


gtcagcacag 


tacaatgtac 


1020 


acatggaatc 


aggccagtag 


tatcaactca 


actgctgtta 


aatggcagtc 


tagcagaaga 


1080 


agatgtagta 


attagatctg 


ccaatttcac 


agacaatgct 


aaaaccataa 


tagtacagct 


1140 


gaacacatct 


gtagaaatta 


attgtacaag 


acccaacaac 


aatacaagaa 


aaagtatccg 


1200 


tatccagagg 


ggaccaggga 


gagcatttgt 


tacaatagga 


aaaataggaa 


atatgagaca 


1260 


agcacattgt 


aacattagta 


gagcaaaatg 


gaatgccact 


ttaaaacaga 


tagctagcaa 


1320 


attaagagaa 


caatttggaa 


ataataaaac 


aataatcttt 


aagcaatcct 


caggagggga 


1380 


cccagaaatt 


gtaacgcaca 


gttttaattg 


tggaggggaa 


tttttctact 


gtaattcaac 


1440 


acaactgttt 


aatagtactt 


ggtttaatag 


tacttggagt 


actgaagggt 


caaataacac 


1500 


tgaaggaagt 


gacacaatca 


cactcccatg 


cagaataaaa 


caatttataa 


acatgtggca 


1560 


ggaagtagga 


aaagcaatgt 


atgcccctcc 


catcagtgga 


caaattagat 


gttcatcaaa 


1620 


tattactggg 


ctgctattaa 


caagagatgg 


tggtaataac 


aacaatgggt 


ccgagatctt 


1680 


cagacctgga 


ggaggcgata 


tgagggacaa 


ttggagaagt 


gaattatata 


aatataaagt 


1740 


agtaaaaatt 


gaaccattag 


gagtagcacc 


caccaaggca 


aagagaagag 


tggtgcagac 


1800 


tagtgcagtg 


ggaataggag 


ctttgttcct 


tgggttcttg 


ggagcagcag 


gaagcactat 


1860 


gggctgcacg 


tcaatgacgc 


tgacggtaca 


ggccagacaa 


ttattgtctg 


atatagtgca 


1920 


gcagcagaac 


aatttgctga 


gggctattga 


ggcgcaacag 


catctgttgc 


aactcacagt 


1980 


ctggggcatc 


aaacagctcc 


aggcaagaat 


cctggctgtg 


gaaagatacc 


taaaggatca 


2040 


acagctcctg 


gggatttggg 


gttgctctgg 


aaaactcatt 


tgcaccactg 


ctgtgccttg 


2100 


gaatgctagt 


tggagtaata 


aatctctgga 


acagatttgg 


aataacatga 


cctggatgga 


2160 


gtgggacaga 


gaaattaaca 


attacacaag 


cttaatacac 


tccttaattg 


aagaatcgca 


2220 


aaaccagcaa 


gaaaagaatg 


aacaagaatt 


attggaatta 


gataaatggg 


caagtttgtg 


2280 


gaattggttt 


aacataacaa 


attggctgtg 


gtatataaaa 


ttattcataa 


tgatagtagg 


2340 


aggcttggta 


ggtttaagaa 


tagtttttgc 


tgtactttct 


atagtgaata 


gagttaggca 


24 00 


gggatattca 


ccattatcgt 


ttcagaccca 


cctcccaatc 


ccgaggggac 


ccgacaggcc 


2460 


cgaaggaata 


gaagaagaag 


gtggagagag 


agacagagac 


agatccattc 


gattagtgaa 


2520 


cggatcctta 


gcacttatct 


gggacgatct 


gcggagcctg 


tgcctcttca 


gctaccaccg 


2580 


cttgagagac 


ttactcttga 


ttgtaacgag 


gattgtggaa 


cttctgggac 


gcagggggtg 


2640 





ggaagccctc 


aaatattggt 


ggaatctcct 


acagtattgg 


agtcaggaac 


taaagaatag 


2700 




tgctgttaac 


ttgctcaatg 


ccacagccat 


agcagtagct 


gagtaa 




2746 




<210> 21 
<211> 3417 
<212> DNA 

<213> Artificial sequence 












<220> 

<223> Modified Env/Tat/Rev/Nef from strain 


BH10 








<400> 21 
gaattctgca 


acaactgctg 


tttatccatt 


ttcagaattg 


ggtgtcgaca 


tagcagaata 


60 




ggcgttactc 


gacagaggag 


agcaagaaat 


ggagccagta 


gatcctagac 


tagagccctg 


120 




gaagcatcca 


ggaagtcagc 


ctaaaactgc 


ttgtaccaat 


tgctattgta 


aaaagtgttg 


180 




ctttcattgc 


caagtttgtt 


tcataacaaa 


agccttaggc 


atctcctatg 


gcaggaagaa 


240 




gcggagacag 


cgacgaagac 


ctcctcaagg 


cagtcagact 


catcaagttt 


ctctatcaaa 


300 


Sees 


gcagtaagta 


gtacatgtaa 


tgcaacctat 


acaaatagca 


atagtagcat 


tagtagtagc 


360 


s 
o 


aataataata 


gcaatagttg 


tgtggtccat 


agtaatcata 


gaatatagga 


aaatattaag 


420 


y 


acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 




tgagagtgaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 


gtggagatgg 


ggcaccatgc 


540 


r s : 
5 


tccttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca gtctattatg 


600 


6 

? — 


gggtacctgt 


gtggaaggaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


~=b 


atgatacaga 


ggtacataat 


gtttgggcca 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


Ms 
w 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg . 


780 




tagaacagat 


gcatgaggat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 




aattaacccc 


actctgtgtt 


agtttaaagt 


gcactgattt 


gaagaatgat 


actaatacca 


900 




atagtagtag 


cgggagaatg 


ataatggaga 


aaggagagat 


aaaaaactgc 


tctttcaata 


960 




tcagcacaag 


cataagaggt 


aaggtgcaga 


aagaatatgc 


atttttttat 


aaacttgata 


1020 




taataccaat 


agataatgat 


actaccagct 


atacgttgac 


aagttgtaac 


acctcagtca 


1080 




ttacacaggc 


ctgtccaaag 


gtatcctttg 


agccaattcc 


catacattat 


tgtgccccgg 


1140 




ctggttttgc 


gattctaaaa 


tgtaataata 


agacgttcaa 


tggaacagga 


ccatgtacaa 


1200 




atgtcagcac 


agtacaatgt 


acacatggaa 


ttaggccagt 


agtatcaact 


caactgctgt 


t o a c\ 
LZ bU 




taaatggcag 


tctggcagaa 


gaagaggtag 


taattagatc 


tgccaatttc 


acagacaatg 


1320 




ctaaaaccat 


aatagtacag 


ctgaaccaat 


ctgtagaaat 


taattgtaca 


agacccaaca 


1380 




acaatacaag 


aaaaagtatc 


cgtatccaga 


gaggaccagg 


gagagcattt 


gttacaatag 


1440 



gaaaaatagg aaatatgaga 
ctttaaaaca gatagatagc 
ttaagcagtc ctcaggaggg 
aatttttcta ctgtaattca 
gtactaaagg gtcaaataac 
aacaaattat aaacatgtgg 
gacaaattag atgttcatca 
gcaacaatga gtccgagatc 
gtgaattata taaatataaa 
caaagagaag agtggtgcag 
tgggagcagc aggaagcact 
aattattgtc tggtatagtg 
agcatctgtt gcaactcaca 
tggaaagata cctaaaggat 
tttgcaccac tgctgtgcct 
ggaataacat gacctggatg 
actccttaat tgaagaatcg 
tagataaatg ggcaagtttg 
aattattcat aatgatagta 
ctgtagtgaa tagagttagg 
tcccgagggg acccgacagg 
acagatccat tcgattagtg 
tgtgcctctt cagctaccac 
aacttctggg acgcaggggg 
ggagtcagga gctaaagaat 
ctgaggggac agatagggtt 
tacctagaag aataagacag 
tcaaaaagta gtgtggttgg 
gcagcagatg gggtgggagc 
agcaacacag cagctaacaa 



caagcacatt gtaacattag 
aaattaagag aacaatttgg 
gacccagaaa ttgtaacgca 
acacaactgt ttaatagtac 
actgaaggaa gtgacacaat 
caggaagtag gaaaagcaat 
aatattacag ggctgctatt 
ttcagacctg gaggaggaga 
gtagtaaaaa ttgaaccatt 
actagtgcag tgggaatagg 
atgggcgcag cgtcaatgac 
cagcagcaga acaatttgct 
gtctggggca tcaagcagct 
caacagctcc tggggatttg 
tggaatgcta gttggagtaa 
gagtgggaca gagaaattaa 
caaaaccagc aagaaaagaa 
tggaattggt ttaacataac 
ggaggcttgg taggtttaag 
cagggatatt caccattatc 
cccgaaggaa tagaagaaga 
aacggatcct tagcacttat 
cgcttgagag acttactctt 
tgggaagccc tcaaatattg 
agtgctgtta gcttgctcaa 
atagaagtag tacaaggagc 
ggcttggaaa ggattttgct 
atggcctgct gtaagggaaa 
agcatctcga gacctagaaa 
tgctgattgt gcctggctag 



tagagcaaaa tggaataaca 
aaataataaa acaataatct 
cagttttaat tgtggagggg 
ttggtttaat agtacttgga 
caccctccca tgcagaataa 
gtatgcccct cccatcagtg 
aacaagagat ggtggtaata 
tatgagggac aattggagaa 
aggagtagca cccaccaagg 
agctttgttc cttgggttct 
gctgacggta caggccagac 
gagggctatt gaggcgcaac 
ccaggcaaga atcctggctg 
gggttgctct ggaaaactca 
taaatctctg gaacagattt 
caattacaca agcttaatac 
tgaacaagaa ttattggaat 
aaattggctg tggtatataa 
aatagttttt gctgtacttt 
gtttcagacc cacctcccaa 
aggtggagag agagacagag 
ctgggacgat ctgcggagcc 
gattgtaacg aggattgtgg 
gtggaatctc ctacagtatt 
tgccacagct atagcagtag 
ttatagagct attcgccaca 
ataagatggg tggcaagtgg 
gaatgagacg agctgagcca 
aacatggagc aatcacaagt 
aagcacaaga ggaggaggag 





gtgggttttc 


cagtcacacc 


tcaggtacct 


ttaagaccaa 


tgacttacaa 


ggcagctgta 


3300 




gatcttagcc 


actttttaaa agaaaagggg ggactggaag 


ggctaattca 


ctcccaacga 


3360 




agacaagata 


tccttgatct gtggatctac 


cacacacaag 


gctacttccc 


tgattag 


3417 




<210> 22 
<211> 2950 
<212> DNA 

<213> Artificial sequence 












<220> 

<223> Modified Env/Nef from strain BH10 










<400> 22 
gaattcgcca 


ccatgggagt 


gaaggagaaa 


tatcagcact 


tgtggagatg 


ggggtggaga 


60 




tggggcacca 


tgctccttgg 


gatgttgatg 


atctgtagtg 


ctacagaaaa 


attgtgggtc 


120 




acagtctatt 


atggggtacc 


tgtgtggaag 


gaagcaacca 


ccactctatt 


ttgtgcatca 


180 


b 


gatgctaaag 


catatgatac 


agaggtacat 


aatgtttggg 


ccacacatgc 


ctgtgtaccc 


240 


acagacccca 


acccacaaga 


agtagtattg 


gtaaatgtga 


cagaaaattt 


taacatgtgg 


300 


O 


aaaaatgaca 


tggtagaaca 


gatgcatgag 


gatataatca 


gtttatggga 

ZJ zj zj zj 


tcaaagccta 


360 


w 


aagccatgtg 


taaaattaac 


cccactctgt 


gttagtttaa 


agtgcactga 


tttgaagaat 


420 


III 

m 


gatactaata 


ccaatagtag 


tagcgggaga 


atgataatgg 


agaaaggaga 


gataaaaaac 


480 


tgctctttca 


atatcagcac 


aagcataaga 


ggtaaggtgc 


agaaagaata 


tgcatttttt 


540 


0 


tataaacttg 


atataatacc 


aatagataat 


gatactacca 


gctatacgtt 


gacaagttgt 


600 


aacacctcag 


tcattacaca 


ggcctgtcca 


aaggtatcct 


ttgagccaat 


tcccatacat 


660 


- ■ 


tattgtgccc 


cggctggttt 


tgcgattcta 


aaatgtaata 


ataagacgtt 


caatggaaca 


720 




ggaccatgta 


caaatgtcag 


cacagtacaa 


tgtacacatg 


gaattaggcc 


agtagtatca 


780 




actcaactgc 


tgttaaatgg 


cagtctggca 


gaagaagagg 


tagtaattag 


atctgccaat 


840 




ttcacagaca 


atgctaaaac 


cataatagta 


cagctgaacc 


aatctgtaga 


aattaattgt 


900 




acaagaccca 


acaacaatac 


aagaaaaagt 


atccgtatcc 


agagaggacc 

ZJ ZJ ZJ ZJ 


agggagagca 


960 




tttgttacaa 


taggaaaaat 


aggaaatatg 


agacaagcac 


attgtaacat 


tagtagagca 


1020 




aaatggaata 


acactttaaa 


acagatagat 


agcaaattaa 


gagaacaatt 


tggaaataat 


1080 




aaaacaataa 


tctttaagca 


gtcctcagga 


ggggacccag 


aaattgtaac 


gcacagtttt 


1140 




aattgtggag 


gggaattttt 


ctactgtaat 


tcaacacaac 


tgtttaatag 


tacttggttt 






aatagtactt 


ggagtactaa 


agggtcaaat 


aacactgaag 


gaagtgacac 


aatcaccctc 


1260 




ccatgcagaa 


taaaacaaat 


tataaacatg 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


1320 




cctcccatca 


gtggacaaat 


tagatgttca 


tcaaatatta 


cagggctgct 


attaacaaga 


1380 



y 
m 

3 

5 s 

o 

few 
: - 



gatggtggta 


atagcaacaa 


tgagtccgag 


atcttcagac 


ctggaggagg 


agatatgagg 


1440 


gacaattgga 


gaagtgaatt 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


1500 


gcacccacca 


aggcaaagag 


aagagtggtg 


cagactagtg 


cagtgggaat 


aggagctttg 


1560 


ttccttgggt 


tcttgggagc 


agcaggaagc 


actatgggcg 


cagcgtcaat 


gacgctgacg 


1620 


gtacaggcca 


gacaattatt 


gtctggtata 


gtgcagcagc 


agaacaattt 


gctgagggct 


1680 


attgaggcgc 


aacagcatct 


gttgcaactc 


acagtctggg 


gcatcaagca 


gctccaggca 


1740 


agaatcctgg 


ctgtggaaag 


atacctaaag 


gatcaacagc 


tcctggggat 


ttggggttgc 


1800 


tctggaaaac 


tcatttgcac 


cactgctgtg 


ccttggaatg 


ctagttggag 


taataaatct 


1860 


ctggaacaga 


tttggaataa 


catgacctgg 


atggagtggg 


acagagaaat 


taacaattac 


1920 


acaagcttaa 


tacactcctt 


aattgaagaa 


tcgcaaaacc 


agcaagaaaa 


gaatgaacaa 


1980 


gaattattgg 


aattagataa 


atgggcaagt 


ttgtggaatt 


ggtttaacat 


aacaaattgg 


2040 


ctgtggtata 


taaaattatt 


cataatgata 


gtaggaggct 


tggtaggttt 


aagaatagtt 


2100 


tttgctgtac 


tttctgtagt 


gaatagagtt 


aggcagggat 


attcaccatt 


atcgtttcag 


2160 


acccacctcc 


caatcccgag 


gggacccgac 


aggcccgaag 


gaatagaaga 


agaaggtgga 


2220 


gagagagaca 


gagacagatc 


cattcgatta 


gtgaacggat 


ccttagcact 


tatctgggac 


2280 


gatctgcgga 


gcctgtgcct 


cttcagctac 


caccgcttga 


gagacttact 


cttgattgta 


2340 


acgaggattg 


tggaacttct 


gggacgcagg 


gggtgggaag 


ccctcaaata 


ttggtggaat 


2400 


ctcctacagt 


attggagtca 


ggagctaaag 


aatagtgctg 


ttagcttgct 


caatgccaca 


2460 


gctatagcag 


tagctgaggg 


gacagatagg 


gttatagaag 


tagtacaagg 


agcttataga 


2520 


gctattcgcc 


acatacctag 


aagaataaga 


cagggcttgg 


aaaggatttt 


gctataagat 


2580 


gggtggcaag 


tggtcaaaaa 


gtagtgtggt 


tggatggcct 


gctgtaaggg 


aaagaatgag 


2640 


acgagctgag 


ccagcagcag 


atggggtggg 


agcagcatct 


cgagacctag 


aaaaacatgg 


2700 


agcaatcaca 


agtagcaaca 


cagcagctaa 


caatgctgat 


tgtgcctggc 


tagaagcaca 


2760 


agaggaggag gaggtgggtt 


ttccagtcac 


acctcaggta 


cctttaagac 


caatgactta 


2820 


caaggcagct 


gtagatctta 


gccacttttt 


aaaagaaaag 


gggggactgg 


aagggctaat 


2880 


tcactcccaa 


cgaagacaag 


atatccttga 


tctgtggatc 


taccacacac 


aaggctactt 


2940 


ccctgattag 












2950 



<210> 23 

<211> 2747 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Modified Env/Tat from strain BH10 
<400> 23 



id 
yi 

s 



a 

B S 

£S3 



gaattctgca 


acaactgctg 


tttatccatt 


ttcagaattg 


ggtgtcgaca 


tagcagaata 


60 


ggcgttactc 


qacaqaqqaq 


agcaagaaat 


ggagccagta 


gatcctagac 


tagagccctg 


120 


gaagcatcca 


ggaagtcagc 


ctaaaactgc 


ttgtaccaat 


tgctattgta 


aaaagtgttg 


180 


ctttcattgc 


caagtttgtt 


tcataacaaa 


agccttaggc 


atctcctatg 


gcaggaagaa 


240 


qcqqaqacaq 

3 3.3^3 3 


cgacgaagac 


ctcctcaagg 


cagtcagact 


catcaagttt 


ctctatcaaa 


300 


gcagtaagta 


gtacatgtaa 


tgcaacctat 


acaaatagca 


atagtagcat 


tagtagtagc 


360 


aataataata 


gcaatagttg 


tgtggtccat 


agtaatcata 


gaatatagga 


aaatattaag 


420 


acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


tqacraatqaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 

—J —J — ' mJ mJ mi mi 


gtggagatgg 


ggcaccatgc 


540 


tccttqqqat 


qttqatqatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca 


gtctattatg 


600 


qqqtacctqt 


qtqqaaqqaa 

3 3 3 3 3 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


atgatacaga 


ggtacataat 


gtttqqqcca 

3 3 3 3 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg 


780 


tagaacagat 


qcatqaqqat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 


aattaacccc 


actctgtgtt 


agtttaaagt 


gcactgattt 


gaagaatgat 


actaatacca 


900 


a tag tag tag 


cqqqaqaatq 


ataatggaga 


aaggagagat 


aaaaaactgc 


tctttcaata 


960 


tcagcacaag 


cataagaggt 


aaggtgcaga 


aagaatatgc 


atttttttat 


aaacttgata 


1020 


taataccaat 


agataatgat 


actaccagct 


atacgttgac 


aagttgtaac 


acctcagtca 


1080 


ttacacaggc 


ctgtccaaag 


gtatcctttg 


agccaattcc 


catacattat 


tgtgccccgg 


1140 


ctggtt ttgc 


gattctaaaa 


tgtaataata 


agacgttcaa 


tggaacagga 


ccatgtacaa 


1200 


atatcaacac 


agtacaatgt 


acacatggaa 


ttaggccagt 


agtatcaact 


caactgctgt 


1260 


taaataacaa 


tctggcagaa 


qaaqaqqtaq 

3 3 3 3 ^3 


taattagatc 


tgccaatttc 


acagacaatg 


1320 


ctaaaaccat 


aatagtacag 


ctgaaccaat 


ctgtagaaat 


taattgtaca 


agacccaaca 


1380 


acaatacaag 


aaaaagtatc 


cgtatccaga 


gaggaccagg 


gagagcattt 


gttacaatag 


T A A A 
144U 


gaaaaatagg 


aaatatgaga 


caagcacatt 


gtaacattag 


tagagcaaaa 


tggaataaca 


1500 


ctttaaaaca 


gatagatagc 


aaattaagag 


aacaatttgg 


aaataataaa 


acaataatct 


1560 


ttaagcagtc 


ctcaggaggg 


gacccagaaa 


ttgtaacgca 


cagttttaat 


tgtggagggg 


1620 


aatttttcta 


ctgtaattca 


acacaactgt 


ttaatagtac 


ttggtttaat 


agtacttgga 


1680 


gtactaaagg 


gtcaaataac 


actgaaggaa 


gtgacacaat 


caccctccca 


tgcagaataa 


1740 



□ 

yj 



m 



aacaaattat 


aaacatgtgg 


caggaagtag 


gaaaagcaat 


gtatgcccct 


cccatcagtg 


1800 


gacaaattag 


atgttcatca 


aatattacag 


ggctgctatt 


aacaagagat 


ggtggtaata 


1860 


gcaacaatga 


gtccgagatc 


ttcagacctg 


gaggaggaga 


tatgagggac 


aattggagaa 


1920 


gtgaattata 


taaatataaa 


gtagtaaaaa 


ttgaaccatt 


aggagtagca 


cccaccaagg 


1980 


caaagagaag 


agtggtgcag 


actagtgcag 


tgggaatagg 


agctttgttc 


cttgggttct 


2040 


tgggagcagc 


aggaagcact 


atgggcgcag 


cgtcaatgac 


gctgacggta 


caggccagac 


2100 


aattattgtc 


tggtatagtg 


cagcagcaga 


acaatttgct 


gagggctatt 


gaggcgcaac 


2160 


agcatctgtt 


gcaactcaca 


gtctggggca 


tcaagcagct 


ccaggcaaga 


atcctggctg 


2220 


tggaaagata 


cctaaaggat 


caacagctcc 


tggggatttg 


gggttgctct 


ggaaaactca 


2280 


tttgcaccac 


tgctgtgcct 


tggaatgcta 


gttggagtaa 


taaatctctg gaacagattt 


2340 


ggaataacat 


gacctggatg 


gagtgggaca 


gagaaattaa 


caattacaca 


agcttaatac 


2400 


actccttaat 


tgaagaatcg 


caaaaccagc 


aagaaaagaa 


tgaacaagaa 


ttattggaat 


2460 


tagataaatg 


ggcaagtttg 


tggaattggt 


ttaacataac 


aaattggctg 


tggtatataa 


2520 


aattattcat 


aatgatagta 


ggaggcttgg 


taggtttaag 


aatagttttt gctgtacttt 


2580 


ctgtagtgaa 


tagagttagg 


cagggatatt 


caccattatc 


gtttcagacc 


cacctcccaa 


2640 


tcccgagggg 


acccgacagg 


cccgaaggaa 


tagaagaaga 


aggtggagag agagacagag 


2700 


acagatccat 


tcgattagtg 


aacggatcct 


tagcacttat 


ctggtaa 




2747 



<210> 24 
<211> 2583 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env 
<400> 24 

gaattcgcca ccatgggagt gaaggagaaa tatcagcact tgtggagatg ggggtggaga 60 
tggggcacca tgctccttgg gatgttgatg atctgtagtg ctacagaaaa attgtgggtc 12 0 
acagtctatt atggggtacc tgtgtggaag gaagcaacca ccactctatt ttgtgcatca 180 
gatgctaaag catatgatac agaggtacat aatgtttggg ccacacatgc ctgtgtaccc 24 0 
acagacccca acccacaaga agtagtattg gtaaatgtga cagaaaattt taacatgtgg 300 
aaaaatgaca tggtagaaca gatgcatgag gatataatca gtttatggga tcaaagccta 360 
aagccatgtg taaaattaac cccactctgt gttagtttaa agtgcactga tttgaagaat 42 0 
gatactaata ccaatagtag tagcgggaga atgataatgg agaaaggaga gataaaaaac 4 80 



a 



ui 
u 
W 

m 

2 

i z 



tgctctttca 


atatcagcac 


aagcataaga 


ggtaaggtgc 


agaaagaata 


tgcatttttt 


540 


tataaacttg 


atataatacc 


aatagataat 


gatactacca 


getataegtt 


gacaagttgt 


600 


aacacctcag 


tcattacaca 


ggcctgtcca 


aaggtatcct 


ttgagccaat 


tcccatacat 


660 


tattgtgccc 


cqqctqqttt 


tgegattcta 


aaatgtaata 


ataagaegtt 


caatggaaca 


720 


ggaccatgta 


caaatgtcag 


cacagtacaa 


tgtacacatg 


gaattaggee 


agtagtatca 


780 


actcaactgc 


tqttaaatqq 


cagtctggca 


gaaqaagagq 


tagtaattag 


atetgecaat 


840 


ttcacagaca 


atgetaaaac 


cataatagta 


cagctgaacc 


aatctgtaga 


aattaattgt 


900 


acaagaccca 


acaacaatac 


aagaaaaagt 


atccgtatcc 


agagaggacc 


agggagagca 


960 


tttgttacaa 


taggaaaaat 


aggaaatatg 


agacaagcac 


attgtaacat 


tagtagagca 


1020 


aaatggaata 


acactttaaa 


acagatagat 


agcaaattaa 


gagaacaatt 


tggaaataat 


1080 


aaaacaataa 


tctttaagca 


qtcctcaqqa 

ZD ZD ZZ3 


qqqqacccaq 


aaattgtaac 


gcacagtttt 


1140 


aattatcraacr 


gggaattttt 


ctactgtaat 


tcaacacaac 


tgtttaatag 


tacttggttt 


1200 


aatagtactt 


ggagtactaa 


agggtcaaat 


aacactgaag 


gaagtgacac 


aatcaccctc 


1260 


ccatgcagaa 


taaaacaaat 


tataaacatg 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


1320 


cctcccatca 


gtggacaaat 


tagatgttca 


tcaaatatta 


cagggctget 


attaacaaga 


1380 


crataQtQcrta 


atagcaacaa 


tqaqtccqaq 

ZD ZD -3 


atcttcagac 


ctggaggagg 


agatatgagg 


1440 


aacaattaaa 


gaagtgaatt 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


1500 


C4 w M 


aggcaaagag 


aaqaqtqqtq 

«_A '-'ZD ZD V-^-J 


cagagagaaa 


aaagagcagt 


gggaatagga 


1560 


gctttgttcc 


ttgggttctt 


qqqaqcaqca 


ggaagcacta 


tgggegcage 


gtcaatgacg 


1620 


ctgacggtac 


aggecagaca 


attattgtct 


ggtatagtgc 


agcagcagaa 


caatttgetg 


1680 


aQQQCtattQ 


aggegcaaca 


gcatctgttg 


caactcacag 


tctggggcat 


caagcagctc 


1740 


raaacaaaaa 


tcctaactot 


ggaaagatac 


ctaaaggatc 


aacagctcct 


ggggatttgg 


1800 


acrt terete ta 


gaaaactcat 


ttgcaccact 


gctgtgcctt 


ggaatgetag 


ttggagtaat 


1860 


aaatctctaa 


aacaaattta 


gaataacatg 


acctqqatqq 


aqtqqqacaq 

ZD ZD ZD ZD 


agaaattaac 


1920 


aattacacaa 


gcttaataca 


ctccttaatt 


gaagaatege 


aaaaccagca 


agaaaagaat 


1980 


gaacaagaat 


tattggaatt 


aqataaatqq 


gcaagtttgt 


qgaattggtt 


taacataaca 


2040 


aattaactat 


ggtatataaa 


attattcata 


atgatagtag 


gaggcttggt 


aggtttaaga 


2100 


atagtttttg 


ctgtactttc 


tgtagtgaat 


agagttaggc 


agggatattc 


accattatcg 


2160 


tttcagaccc 


acctcccaat 


cccgagggga 


cccgacaggc 


ccgaaggaat 


agaagaagaa 


2220 


ggtggagaga 


gagacagaga 


cagatccatt 


cgattagtga 


aeggatcett 


agcacttatc 


2280 


tgggacgatc 


tgeggagect 


gtgcctcttc 


agctaccacc 


gcttgagaga 


cttactcttg 


2340 



attgtaacga ggattgtgga acttctggga cgcagggggt gggaagccct caaatattgg 

tggaatctcc tacagtattg gagtcaggag ctaaagaata gtgctgttag cttgctcaat 

gccacagcta tagcagtagc tgaggggaca gatagggtta tagaagtagt acaaggagct 

tatagagcta ttcgccacat acctagaaga ataagacagg gcttggaaag gattttgcta 
taa 



<210> 25 
<211> 108 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 25 

tgtacaagac ccaacaacaa tacaagaaaa agtatccgta tccagagagg accagggaga 
gcatttgtta caataggaaa aataggaaat atgagacaag cacattgt 



<210> 26 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 26 

tgtaccagac ctaacaacaa tacaagaaaa agtgtacgta taggaccagg acaaacattc 
tatgcaacag gtgatataat aggggatata agacaagcac attgt 



<210> 27 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 27 

tgtacgagac ccaacaataa tacaagaaaa agtataagga taggaccagg acaagcattc 
tatgcaacag gagaaataat aggagatata agacaagcac attgt 



<210> 28 
<211> 102 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 28 

tgcacaaggc cctacaacaa tataagacaa aggaccccca taggactagg gcaagcactc 
tatacaacaa gaagaataga agatataaga agagcacatt gt 



<210> 29 

<211> 105 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 29 

tgtaccagac cctccaccaa tacaagaaca agtatacgta taggaccagg acaagtattc 



60 



tatagaacag gagacataac aggagatata agaaaagcat* attgt 105 

<210> 30 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 30 

tgtacaagac ccaacaacaa tacaagaaaa agaatatctt taggaccagg acgagtattt 60 
tatacagcag gagaaataat aggagacatc agaaaggcac attgt 105 

<210> 31 

<211> 105 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 31 

tgtaccagac ctaataacaa tacaagaaaa agtataactt ttgcaccagg acaagcgctc 



<400> 32 



60 



tatgcaacag gtgaaataat aggagatata agacaagcac attgt 105 

n 

W <210> 32 

U <211> 2562 

y <212> DNA 

|Jj <213> Artificial sequence 

?. <220> 

<223> Env with multi-clade V3 loops 



atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg ggtggagatg gggcaccatg 


60 


ctccttggga 


tgttgatgat 


ctgtagtgct 


acagaaaaat 


tgtgggtcac 


agtctattat 


12 0 


ggggtacctg 


tgtggaagga 


agcaaccacc 


actctatttt 


gtgcatcaga 


tgctaaagca 


180 


tatgatacag 


aggtacataa 


tgtttgggcc 


acacatgcct 


gtgtacccac 


agaccccaac 


240 


ccacaagaag 


tagtattggt 


aaatgtgaca 


gaaaatttta 


acatgtggaa 


aaatgacatg 


300 


gtagaacaga 


tgcatgagga 


tataatcagt 


ttatgggatc 


aaagcctaaa 


gccatgtgta 


360 


aaattaaccc 


cactctgtgt 


tggagctggt 


agttgtaaca 


cctcagtcat 


tacacaggcc 


420 


tgtccaaagg 


tatcctttga 


gccaattccc 


atacattatt 


gtgccccggc 


tggttttgcg 


480 


attctaaaat 


gtaataataa 


gacgttcaat 


ggaacaggac 


catgtacaaa 


tgtcagcaca 


540 


gtacaatgta 


cacatggaat 


taggccagta 


gtatcaactc 


aactgctgtt 


aaatggcagt 


600 


ctggcagaag 


aagaggtagt 


aattagatct 


gccaatttca 


cagacaatgc 


taaaaccata 


660 


atagtacagc 


tgaaccaatc 


tgtagaaatt 


aattgtacaa 


gacccaacaa 


caatacaaga 


720 



u 

hi 
Id 
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ddddy l>Qt^^ 


atatccaoaa 

y ^ d i— u ^» BM 


aaaaccaaaci 

^* wc 


agagcatttg 


ttacaatagg 


aaaaatagga 


780 


aatataaaac 


aaacacatta 


tctcaacrtat 


accagaccta 


acaacaatac 


aaqaaaaaqt 


840 


crt"apcft"at"acr 

y L.u.L.y l. d l. d y 


cr a p c a crcr a p a 

ydv^^dyydod 


aacattctat 


gcaacaggtg 


atataatagg 


qqatataacra 


900 


caaocacatt 


attatacaao 


acccaacaat 


aatacaagaa 


aaagtataag 


gataggacca 


960 


craacaaacat 

C*. C* 1 lAWiM \^ C*. K- 


tetatgeaac 


aggagaaata 


ataggagata 


taagacaagc 


acattgttgc 


1020 


acaaggccct 


acaacaatat 


aagacaaagg 


acccccatag 


gactagggca 


agcactctat 


1080 


acaacaaaaa 


aaataaaaaa 

c* c* C* c*^n w *-*■ 


tataagaaga 


gcacattgtt 


gtaccagacc 


ctccaccaat 


1140 


a raaaaacaa 

d v«. ddM d d *w dd 


atatacatat 

V* C* C* *-4. C-* w W. C* 


aacraccaaaa 


caagtattct 


atagaacagg 


agacataaca 


1200 


crcr a era t - a t" a a 

MM CI y CI l> d t» dd 


aaaaaacata 

C* dcmM C* V* C* w*. 


ttgtggatcc 


tgtacaagac 


ccaacaacaa 


tacaagaaaa 


1260 


aaaat*atctt 

Ci Ci C» C4. W C Cr 


tacraaccaacr 


acgagtattt 


tatacagcag 


gagaaataat 


aggagacatc 


1320 


aaaaaaocac 


attattatac 


cagacctaat 


aacaatacaa 


gaaaaagtat 


aacttttgea 


1380 


ccaaaacaaa 


cgctctatgc 


aacaggtgaa 


ataataggag 


atataagaca 


agcacattgt 


1440 


ctcaaaaaca 


ttaataaaac 

V** w C4»^"j Co W*. ~J 


aaaatggaat 


aacactttaa 


aacagataga 


tagcaaatta 


1500 


dy dy d d ^ c*. d i_ 


t* taaaaataa 


taaaacaata 


atctttaagc 


agtcctcagg 


aggggaccca 


1560 


rra aat"t~crt"aa 

y ddd L, l_ y L. CICl 


racacaattt 

^ M w d w dM Lp w 


taattataaa 

C* C4> C4w Co ^-j C- ^1 L-4, 


ggggaatttt 


tctactgtaa 


ttcaacacaa 


1620 


pt"crt"i~t~aata 
^ ^-y " dd w d 


atacttaatt 


taatagtact 


tggagtacta 


aagggtcaaa 


taacactgaa 


1680 


crcr a acrtcrapa 


caatcaccct 


cccatgcaga 


ataaaacaaa 


ttataaacat 


qtqqcaqqaa 

_3 _3Z3 — »— ' 


1740 


atacrcraaaaQ 


caatatatac 


ccctcccatc 


agtggacaaa 


ttagatgttc 


atcaaatatt 


1800 


a p a crcrcrp t" a p 
dv_» dy y y v-« ty v«* 


fattaacaaa 


acra t era fc crcr t 

dy d uyy uyy v- 


aatagcaaca 


atgagtccga 


gatcttcaga 


1860 


ppf crcracrcracr 


era era t" a t" era a 

ydydL»dL.ydy 




agaagtgaat 


tatataaata 


taaagtagta 


1920 


aaaaffaaar 

dddd L» ^»y d C*. 


ra ttacrcraat 

^— - d k~ C d>j d^J \— 


agcacccacc 


aaggcaaaga 


qaaaacjtQQt 


qcaqactaqt 


1980 


crpacrt~crcrcra a 

y l. dy i_y y y d d 


t aaaaact tt 


crt t cc t tcrcrcr 


ttcttgggag 


cagcaggaag 


cactatgggc 


2040 


y v_> dy t_«y l> wdd 


1~cracactcrac 

uM d 3 dV* 


aatacaaacc 


agacaattat 


tgtctggtat 


agtgcagcag 


2100 


p a era a p a a t~ t~ 

^_ dy d d V_ d d L- L. 


t"crpt"aaaacrc 

uy \~« i— y dy y y v— 


tattaaacrccr 


caacagcatc 


tgttgcaact 


cacagtctgg 


2160 




a cr r* t" r* r* a cr cr r 1 


a acra a t* rc t a 

d d^J d d V- V— \— ^ 


crctcrtcrcraaa 


aatacctaaa 


ggatcaacag 


2220 




i~ t" t - ercrcrcrt" t" cr 

LLL yyyy LL y 


pi~ p1"crcraaaa 

i— w y y dddd 


p tcatt tcrca 

V— ' C- V-' C*. C" C» Co C>- C4> 


ccactgctgt 


gccttggaat 


2280 


rfpf acrt" "her era 
y ^ Lay l. t_y y d 


crt"aat*aaatc 

y l> dd c ddd >— • 


tctaaaacaa 

C> C^ C*> C* C C4> v-i 


atttggaata 


acatgacctg 


qatqqaqtqq 


2340 


gacagagaaa 


ttaacaatta 


cacaagctta 


atacactcct 


taattgaaga 


ategcaaaac 


2400 


cagcaagaaa 


agaatgaaca 


agaattattg 


gaattagata 


aatgggcaag 


tttgtggaat 


2460 


tggtttaaca 


taacaaattg 


gctgtggtat 


ataaaatcgt 


ggctgetget 


gctcctgctc 


2520 


tccctctccc 


tcctccaggc 


caeggattte 


atgtccctgt 


ga 




2562 



<210> 33 
<211> 853 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Modified Env with multi-clade V3 loops 
<400> 33 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp 
100 105 110 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 120 125 



Ala Gly Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys -Val 
130 ' 135 140 



Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 



195 



200 



205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr lie lie Val Gin Leu 
210 215 220 



Asn Gin Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr lie 
245 250 255 



Gly Lys He Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg He Gly Pro Gly Gin Thr 
275 280 285 



Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gly Pro 
305 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
3 55 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 * 410 415 



Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu He He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser lie Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu lie lie Gly Asp lie Arg Gin Ala His Cys 
465 470 475 480 



Leu Gly Asn lie Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin lie 
485 490 495 



Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr lie lie Phe 
500 505 510 



Lys Gin Ser Ser Gly Gly Asp Pro Glu lie Val Thr His Ser Phe Asn 
515 520 525 



Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser 
530 " 535 540 



Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly Ser Asn Asn Thr Glu 
545 550 555 560 



Gly Ser Asp Thr lie Thr Leu Pro Cys Arg lie Lys Gin lie lie Asn 
565 570 575 



Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro lie Ser Gly 
580 585 590 



Gin lie Arg Cys Ser Ser Asn lie Thr Gly Leu Leu Leu Thr Arg Asp 
595 600 605 



Gly Gly Asn Ser Asn Asn Glu Ser Glu lie Phe Arg Pro Gly Gly Gly 
610 615 620 



Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
625 630 635 640 



Lys lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 
645 650 655 



Val Gin Thr Ser Ala Val Gly lie Gly Ala Leu. Phe Leu Gly Phe Leu 
660 665 670 



Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 
675 680 685 



Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu 
690 " 695 700 



Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp 
705 710 715 720 



Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu 
725 730 735 



Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu He 
740 745 750 



Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
755 760 765 



Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp- Arg Glu He 
770 775 780 



Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 
785 790 795 800 



Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
805 810 815 



Ser Leu Trp Asn Trp Phe Asn He Thr Asn Trp Leu Trp Tyr He Lys 
820 825 830 



Ser Trp Leu Leu Leu Leu Leu Leu Ser Leu Ser Leu Leu Gin Ala Thr 
835 840 845 



Asp Phe Met Ser Leu 
850 



<210> 34 
<211> 1092 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 34 

atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgatggga aaaaattcgg 
ttaaggccag ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag 
ctagaacgat tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata 
ctgggacagc tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat 
acagtagcaa ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct 



o 

n 



s 



ttagacaaga 


tagaggaaga 


gcaaaacaaa 


agtaagaaaa aagcacagca 


agcagcagct 


360 


qacacaqqac 


acagcagtca 


ggtcagccaa 


aattacccta tagtgcagaa 


catccagggg 


420 


caaatggtac 


atcaggccat 


atcacctaga 


actttaaatg catgggtaaa 


agtagtagaa 


480 


gagaaggctt 


tcagcccaga 


agtaataccc 


atgttttcag cattatcaga 


aggagccacc 


540 


ccacaagatt 


taaacaccat 


gctaaacaca 


gtggggggac atcaagcagc 


catgcaaatg 


600 


ttaaaagaga 


ccatcaatga 


ggaaqctgca 

ZJ ZJ ZJ — > 


gaatgggata gagtacatcc 


agtgcatgca 


660 


qqqcctattg 


caccaggcca 


gatgagagaa 


ccaaggggaa gtgacatagc 


aggaactact 


720 


agtacccttc 


aggaacaaat 


aggatggatg 

^ ZJ ZJ ZJ ZJ ZJ 


acaaataatc cacctatccc 


agtaggagaa 


780 


atttataaaa 


gatggataat 


cctgggatta 


aataaaatag taagaatgta 


tagccctacc 


840 


agcattctgg 


acataagaca 


aggaccaaaa 


gaacctttta gagactatgt 


agaccggttc 


900 


tataaaactc 


taagagccga 


gcaagcttca 


caggaggtaa aaaattggat 


gacagaaacc 


you 


ttgttggtcc 


aaaatgcgaa 


cccagattgt 


aagactattt taaaagcatt 


gggaccagcg 


1020 


gctacactag 


aagaaatgat 


gacagcatgt 


cagggagtag gaggacccgg 


ccataaggca 


1080 


agagttttgt 


aa 








1092 



<210> 35 

<211> 1179 

<212> DNA 

«i <213> Human immunodeficiency virus type 1 



<400> 35 



atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg ggtggagatg 


gggcaccatg 


60 


ctccttggga 


tgttgatgat 


ctgtagtgct 


ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 


gaattagatc 


gatgggaaaa 


aattcggtta 


aggccagggg 


gaaagaaaaa 


atataaatta 


180 


aaacatatag 


tatgggcaag 


cagggagcta 


gaacgattcg 


cagttaatcc 


tggcctgtta 


240 


gaaacatcag 


aaggctgtag 


acaaatactg 


ggacagctac 


aaccatccct 


tcagacagga 


300 


tcagaagaac 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 


atagagataa 


aagacaccaa 


ggaagcttta 


gacaagatag 


aggaagagca 


aaacaaaagt 


420 


aagaaaaaag 


cacagcaagc 


agcagctgac 


acaggacaca 


gcagtcaggt 


cagccaaaat 


480 


taccctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


540 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


600 


ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


660 


gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


720 


tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


780 





^yyyy «»y «-y 


arataacaoa 

GL CI *— CI w 3 J 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


840 






rtatcccaat 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


900 




aaaat'aat'aa 


aaatatataa 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


960 




ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


1020 




gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


1080 




actattttaa 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


1140 




ggagtaggag 


gacccggcca 


taaggcaaga gttttgtaa 






1179 




<210> 36 
<211> 1308 
<212> DNA 

<213> Human immunodeficiency virus type 1 










<400> 36 
atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg 


qgtggagatg 


gggcaccatg 


60 




ctccttggga 


tgttgatgat 


ctgtagtgct 


ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 


OO 


gaattagatc 


gatgggaaaa 


aattcggtta 


aggccagggg 


gaaagaaaaa 


atataaatta 


180 


yj 


aaacatatag 


tatgggcaag 


cagggagcta 


gaacgattcg 


cagttaatcc 


tggcctgtta 


240 


d 


gaaacatcag 


aaggctgtag 


acaaatactg 


ggacagctac 


aaccatccct 


tcagacagga 


300 


hi 

w 


tcagaagaac 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 


3 


a tag ay a. taa 


cl ciy a. L. cL t- O cl a 


y y ctcty u L L La 




aggaagagca 


aaacaaaagt 


420 


e ; 


aagaaaaaag 


cacagcaagc 


agcagctgac 


acaggacaca 


qcaqtcaqqt 


cagccaaaat 


480 


f 5 

s . 


taccctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


540 


M> 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


600 


ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


660 




gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga agctgcagaa 


720 




tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


780 




aggggaagtg 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


840 




aataatccac 


ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct gggattaaat 


900 




aaaatagtaa 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg accaaaagaa 


960 




ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


1020 




gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


1080 




actattttaa 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


1140 




ggagtaggag 


gacccggcca 


taaggcaaga 


gttttgttat 


tcataatgat 


agtaggaggc 


1200 



ttggtaggtt taagaatagt ttttgctgta ctttctgtag tgaatagagt taggcaggga 
tattcaccat tatcgtttca gacccacctc ccaatcccga ggggataa 



<210> 37 
<211> 363 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 37 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
20 25 30 



His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
35 40 45 



Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu 
50 55 60 



Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp 
85 90 95 



Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys 
100 105 HO 



Lys Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val 
115 120 125 



Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His 
130 135 140 



Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 160 



Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 
165 170 175 



Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
180 185 190 



Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 



195 



200 



205 



Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro lie Ala 
210 215 220 



Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp lie Ala Gly Thr Thr 
225 230 235 240 



Ser Thr Leu Gin Glu Gin lie Gly Trp Met Thr Asn Asn Pro Pro lie 
245 250 255 



Pro Val Gly Glu lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn Lys 
260 265 270 



lie Val Arg Met Tyr Ser Pro Thr Ser lie Leu Asp lie Arg Gin Gly 
275 280 285 



Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
290 295 300 



Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
305 310 315 320 



Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala 
325 330 335 



Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
340 345 350 



Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 
355 360 



<210> 38 
<211> 410 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 38 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys lie 
35 40 45 



Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His IleVal 
50 55 60 



Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu 
65 70 75 80 



Glu Thr Ser Glu Gly Cys Arg Gin lie Leu Gly Gin Leu Gin Pro Ser 
85 90 95 



Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala 
100 105 110 



Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp Thr Lys Glu 
115 12 0 125 



Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala 
130 135 140 



Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn 
145 150 155 160 



Tyr Pro Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 



Gin Asn Tyr Pro Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin 
180 185 190 



Val Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val 
195 ~ 200 205 



His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val 
210 215 220 



Glu Glu Lys Ala Phe Ser Pro Glu Val lie Pro Met Phe Ser Ala Leu 
225 230 235 240 



Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val 
245 250 255 



Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu 
260 265 270 



Glu Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He 
275 280 285 



Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr 
290 295 300 



Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro 
305 310 315 320 



He Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn 
325 ' 330 335 



Lys He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin 
340 345 350 



Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr 
355 360 365 



Leu Arg Ala Glu Gin Ala Ser Gin Glu Val Thr He Leu Lys Ala Leu 
370 375 380 



Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
385 390 395 400 



Gly Gly Pro Gly His Lys Ala Arg Val Leu 
405 410 



<210> 39 
<211> 453 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 39 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
1 ~ 5 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys He 
35 40 45 



Arg Leu Arg Pro Gly Gly Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu 
50 55 60 



Lys He Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His 
65 " 70 } 75 80 



He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly 
85 90 95 



Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin lie Leu Gly Gin Leu Gin 
100 105 110 



Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr 
115 120 125 



Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr 
130 135 140 



Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys 
145 ~ 150 155 160 



Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 



Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin 
180 185 190 



Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu 
195 200 205 



Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu 
210 215 220 



Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly 
225 230 235 240 



His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala 
245 250 255 



Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro 
260 265 270 



Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser 
275 280 285 



Thr Leu Gin Glu Gin lie Gly Trp Met Thr Asn Asn Pro Pro He Pro 
290 295 300 



Val Gly Glu He Tyr Lys Arg Trp lie He Leu Gly Leu Asn Lys He 
305 ~ ' 310 ~ 315 " 320 



Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp lie Arg Gin Gly Pro 
325 330 335 



Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg 
340 " 345 350 



Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu 
355 360 365 



Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu 
370 375 380 



Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
385 390 395 400 



Gly Gly Pro Gly His Lys Ala Arg Val Leu Leu Phe He Met He Val 
405 410 415 



Gly Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser Val Val 
420 425 430 



q Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu 

5? 435 " 440 445 

%J 

z z 
; s * 
V* 

Q Pro He Pro Arg Gly 

kj 450 

in 

f R <210> 40 

f~ <211> 399 

H 5 <212> DNA 

P <213> Human immunodeficiency virus type 1 



a - 



m <400> 40 



atgggtgcga 


gagcgtcagt 


attaagcggg 


ggagaattag 


atcgatggga 


aaaaattcgg 


60 


ttaaggccag 


ggggaaagaa 


aaaatataaa 


ttaaaacata 


tagtatgggc 


aagcagggag 


120 


ctagaacgat 


tcgcagttaa 


tcctggcctg 


ttagaaacat 


cagaaggctg tagacaaata 


180 


ctgggacagc 


tacaaccatc 


ccttcagaca 


ggatcagaag 


aacttagatc 


attatataat 


240 


acagtagcaa 


ccctctattg 


tgtgcatcaa 


aggatagaga 


taaaagacac 


caaggaagct 


300 


ttagacaaga 


tagaggaaga 


gcaaaacaaa 


agtaagaaaa 


aagcacagca 


agcagcagct 


360 


gacacaggac 


acagcagtca 


ggtcagccaa 


aattactaa 






399 



<210> 41 

<211> 486 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 41 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60 





ctccttggga 


tgttgatgat 


ctgtagtgct ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 




gaattagatc 


gatgggaaaa 


aattcggtta aggccagggg 


gaaagaaaaa 


atataaatta 


180 




aaacatatag 


tatgggcaag 


cagggagcta gaacgattcg cagttaatcc 


tggcctgtta 


240 




gaaacatcag 


aaggctgtag 


acaaatactg ggacagctac 


aaccatccct 


tcagacagga 


300 




tcagaagaac 


ttagatcatt 


atataataca gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 




atagagataa 


aagacaccaa 


ggaagcttta gacaagatag 


aggaagagca 


aaacaaaagt 


420 




aagaaaaaag 


cacagcaagc 


agcagctgac acaggacaca 


gcagtcaggt 


cagccaaaat 


480 




tactaa 










486 




<210> 42 
<211> 615 
<212> DNA 

<213> Human immunodeficiency virus type 1 








M. 

o 


<400> 42 
atgagagtga 


aggagaaata 


tcagcacttg tggagatggg 


ggtggagatg 


gggcaccatg 


60 


O 


ctccttggga 


tgttgatgat 


ctgtagtgct ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 


hi 
□ 


gaattagatc 


gatgggaaaa 


aattcggtta aggccagggg 


gaaagaaaaa 


atataaatta 


180 


y s 


aaacatatag 


tatgggcaag 


cagggagcta gaacgattcg 


cagttaatcc 


tggcctgtta 


240 


gaaacatcag 


aaggctgtag 


acaaatactg ggacagctac 


aaccatccct 


tcagacagga 


300 


M 


tcagaagaac 


ttagatcatt 


atataataca gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 


£ 5_ 


atagagataa 


aagacaccaa 


ggaagcttta gacaagatag 


aggaagagca 


aaacaaaagt 


420 


: 

Q 


aagaaaaaag 


cacagcaagc 


agcagctgac acaggacaca 


gcagtcaggt 


cagccaaaat 


480 




tacttattca 


taatgatagt 


aggaggcttg gtaggtttaa 


gaatagtttt 


tgctgtactt 


540 




tctgtagtga 


atagagttag 


gcagggatat tcaccattat 


cgtttcagac 


ccacctccca 


600 




atcccgaggg 


gataa 








615 




<210> 43 
<211> 132 
<212> PRT 

<213> Human immunodeficiency virus type 1 










<400> 43 














Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10 15 






Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys 
20 25 


Lys Tyr Lys Leu Lys 
30 





s S 

-SSB9 

u 



His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
35 40 45 



Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin lie Leu Gly Gin Leu 
50 55 60 



Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 " 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp 
85 90 95 



Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys 
100 105 110 



Lys Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val 
115 120 125 



Ser Gin Asn Tyr 
130 



<210> 44 

<211> 179 

<212> PRT 

<213> Human immunodeficiency virus type 1 



<400> 44 



t: 

P Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 



15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys He 
35 40 45 



Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His He Val 
50 ~ 55 60 



Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu 
65 70 75 80 



Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu Gin Pro Ser 
85 90 95 



Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Gly Gin Leu Gin 



100 



105 



110 



Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr 
115 120 125 



Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr 
13 0 135 14 0 



Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys 
145 ~ 150 155 160 



Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 



Gin Asn Tyr 



<210> 45 
<211> 186 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 45 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys lie 
35 40 45 



Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His lie Val 
50 ~ 55 60 



Trp Ala Ser Arg Glu Leu Glu Arg Gly Gin Leu Gin Pro Ser Leu Gin 
65 70 75 80 



Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu 
85 90 95 



Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr Lys Glu Ala Leu 
100 105 110 



Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin Gin 
115 120 125 



in 

r 

Q 



Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr Leu 
130 135 140 



Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe Ala 
145 150 155 160 



Val Leu Ser Val Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser 
165 170 175 



Phe Gin Thr His Leu. Pro He Pro Arg Gly 
180 185 



<210> 46 
<211> 699 
<212> DNA 

<213> Human immunodeficiency virus type 1 








<400> 46 
atgcctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


60 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


120 


ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


180 


gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


240 


tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


300 


aggggaagtg 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


360 


aataatccac 


ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


420 


aaaatagtaa 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


480 


ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


540 


gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


600 


actattttaa 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


660 


ggagtaggag 


gacccggcca 


taaggcaaga 


gttttgtaa 






699 



<210> 47 
<211> 786 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 47 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60 
ctccttggga tgttgatgat ctgtagtgct cctatagtgc agaacatcca ggggcaaatg 12 0 
gtacatcagg ccatatcacc tagaacttta aatgcatggg taaaagtagt agaagagaag 180 



gctttcagcc cagaagtaat acccatgttt tcagcattat cagaaggagc caccccacaa 24 0 





gatttaaaca 


ccatgctaaa 


cacagtgggg 


ggacatcaag 


cagccatgca 


aatgttaaaa 


300 




gagaccatca 


atgaggaagc 


tgcagaatgg 


gatagagtac 


atccagtgca 


tgcagggcct 


360 




attgcaccag 


gccagatgag 


agaaccaagg 


ggaagtgaca 


tagcaggaac 


tactagtacc 


420 




cttcaggaac 


aaataggatg gatgacaaat 


aatccaccta 


tcccagtagg 


agaaatttat 


480 




aaaagatgga 


taatcctggg 


attaaataaa 


atagtaagaa 


tgtatagccc 


taccagcatt 


540 




ctggacataa 


gacaaggacc 


aaaagaacct 


tttagagact 


atgtagaccg 


gttctataaa 


600 




actctaagag 


ccgagcaagc 


ttcacaggag 


gtaaaaaatt 


ggatgacaga 


aaccttgttg 


660 




gtccaaaatg 


cgaacccaga 


ttgtaagact 


attttaaaag 


cattgggacc 


agcggctaca 


720 




ctagaagaaa 


tgatgacagc 


atgtcaggga 


gtaggaggac 


ccggccataa ggcaagagtt 


780 




ttgtaa 












786 


b 


<210> 48 
<211> 915 
<212> DNA 

<213> Human immunodeficiency virus type 1 








y 
a 

c „ e 

{ULE 

; f: 


<400> 48 
atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg 


ggtggagatg 


gggcaccatg 


60 


ctccttggga 


tgttgatgat 


ctgtagtgct 


cctatagtgc 


agaacatcca 


ggggcaaatg 


120 


S 


gtacatcagg 


ccatatcacc 


tagaacttta 


aatgcatggg 


taaaagtagt 


agaagagaag 


180 


{ . z 


gctttcagcc 


cagaagtaat 


acccatgttt 


tcagcattat 


cagaaggagc 


caccccacaa 


240 


o 


gatttaaaca 


ccatgctaaa 


cacagtgggg 


ggacatcaag 


cagccatgca 


aatgttaaaa 


300 


f — 

s : 


gagaccatca 


atgaggaagc 


tgcagaatgg 


gatagagtac 


atccagtgca 


tgcagggcct 


360 




attgcaccag 


gccagatgag 


agaaccaagg 


ggaagtgaca 


tagcaggaac 


tactagtacc 


420 




cttcaggaac 


aaataggatg 


gatgacaaat 


aatccaccta 


tcccagtagg 


agaaatttat 


480 




aaaagatgga 


taatcctggg 


attaaataaa 


atagtaagaa 


tgtatagccc 


taccagcatt 


540 




ctggacataa 


gacaaggacc 


aaaagaacct 


tttagagact 


atgtagaccg 


gttctataaa 


600 




actctaagag 


ccgagcaagc 


ttcacaggag 


gtaaaaaatt 


ggatgacaga 


aaccttgttg 


660 




gtccaaaatg 


cgaacccaga 


ttgtaagact 


attttaaaag 


cattgggacc 


agcggctaca 


720 




ctagaagaaa 


tgatgacagc 


atgtcaggga 


gtaggaggac 


ccggccataa 


ggcaagagtt 


780 




ttgttattca 


taatgatagt 


aggaggcttg 


gtaggtttaa gaatagtttt 


tgctgtactt 


840 




tctgtagtga 


atagagttag 


gcagggatat 


tcaccattat 


cgtttcagac 


ccacctccca 


900 




atcccgaggg 


gataa 










915 



<210> 49 
<211> 232 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 49 

Met Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He 
15 10 15 



Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala 
20 25 30 



Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala 
35 40 45 



Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
50 55 60 



Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu 
65 70 75 80 



Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin 
85 90 95 



Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu 
100 105 HO 



Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro lie Pro Val Gly 
115 120 125 



Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg 
130 ' ' ' 135 140 



Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
145 150 155 160 



Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu 
165 170 175 



Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
180 185 ISO 



Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro 
195 200 205 



Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
210 215 220 



Pro Gly His Lys Ala Arg Val Leu 
225 ' 230 



<210> 50 
<211> 261 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 50 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Pro lie 
20 25 30 



Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
35 40 45 



Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 
50 55 60 



Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 
65 70 75 80 



Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 
85 90 95 



Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg 
100 105 110 



Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu 
115 120 125 



Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 
130 ' " 135 140 



He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr 
145 150 155 160 



Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser 
165 170 175 



Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg 
180 185 190 



Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser 
195 200 205 



Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala 
210 215 220 



Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr 
225 * 230 235 240 



Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 
245 250 255 



Lys Ala Arg Val Leu 
260 



<210> 51 
<211> 286 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 51 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Pro He 
20 25 30 



Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
35 40 45 



Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 
50 55 60 



Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 
65 70 75 80 



Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 
85 90 95 



Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Arg Glu Pro Arg 
100 105 HO 



Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly 
115 120 125 



Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg 
130 135 140 



a 
Q 



Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro Thr 
145 150 155 160 



Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr 
165 ~ 170 175 



Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu 
180 185 190 



Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro 
195 200 205 



Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu 
210 215 220 



Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala 
225 230 235 240 



Arg Val Leu Leu Phe He Met lie Val Gly Gly Leu Val Gly Leu Arg 
245 250 255 



y He Val Phe Ala Val Leu Ser Val Val Asn Arg Val Arg Gin Gly Tyr 

m 260 265 270 



Ser Pro Leu Ser Phe Gin Thr His Leu Pro He Pro Arg Gly 
275 280 285 



g <210> 52 

fT <211> 3839 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env/Tat 
<400> 52 

gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 60. 
ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 12 0 
gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 18 0 
ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 240 
gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 300 
gcagtaagta gtacatgtaa tgcaacctat acaaatagca atagtagcat tagtagtagc 360 
aataataata gcaatagttg tgtggtccat agtaatcata gaatatagga aaatattaag 42 0 



y 

p 

m 

s 

r 

0 
u 



acaaacraaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


taaaacrtcraa 


ggagaaatat 


cagcacttgt 


ggaqatqqqq 

33 3 3333 


gtggagatgg 


ggcaccatgc 


540 


tec ttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gt999tcaca 
_j j — ' 


gtctattatg 


600 


qqqtacctqt 


qtqqaaqqaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


a t era t acacia 


ggtacataat 


atttaaacca 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


raraaaaaat 


aatattaata 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg 


780 


taaaacaaat 


acatcraaaat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 


aattaacccc 


actctgtgtt 


aqaQCtqcrta 


gttgtaacac 


ctcagtcatt 


acacaggcct 


900 


gtccaaaggt 


atcctttgag 


ccaattccca 


tacattattg 


tgccccggct 


ggttttgcga 


960 


ttctaaaatg 


taataataag 


acgttcaatg 


gaacaggacc 


atgtacaaat 


gtcagcacag 


1020 


tacaatotac 


acatggaatt 


aqcrccaqtaq 


tatcaactca 


actgctgtta 


aatggcagtc 


1080 


tacrcaaaaaa 


agaggtagta 


attagatctg 


ccaatttcac 


agacaatget 


aaaaccataa 


1140 


t aatacaQC t 


gaaccaatct 


gtagaaatta 


attgtacaag 


acccaacaac 


aatacaagaa 


1200 


aaaatatcca 


tatccagaga 


qqaccaqqqa 


qaqcatttqt 


tacaatagga 


aaaataggaa 


1260 


atatgagaca 


agcacattgt 


ctcqqqtqta 


ccagacctaa 


caacaataca 


agaaaaagtg 


1320 


tacatataaa 


accaggacaa 


acat tctatg 


caacaggtga 


tataataggg 


gatataagac 


1380 


aaaracatta 


ttataccracra 


cccaacaata 


atacaagaaa 


aagtataagg 


ataggaccag 


1440 


aacaaacatt 


etatgeaaca 


ggagaaataa 


taggagatat 


aagacaagca 


cattgttgca 


1500 


raaaarccta 

v«» C4. qa V—» 


caacaatata 


agacaaagga 


cccccatagg 


actagggcaa 


gcactctata 


1560 


caacaaoaaa 


aatagaagat 


ataagaagag 


cacattgttg 


taccagaccc 


tccaccaata 


1620 


caagaacaag 


tataegtata 


qqaccaqqac 


aagtattcta 


tagaacagga 


gacataacag 


1680 


gagatataag 


aaaagcatat 


tgtggatcct 


gtacaagacc 


caacaacaat 


acaagaaaaa 


1740 


aaatatcttt 


aaaaccaaaa 

w*. ^-j fct> v-» w*> w*. 


cgagtatttt 


atacagcagg 


agaaataata 


ggagacatca 


1800 


era a a acre a c a 


ttattatacc 


agacctaata 


acaatacaag 


aaaaagtata 


acttttgeae 


1860 




actctataca 


acaoatqaaa 


taataggaga 


tataagacaa 


gcacattgtc 


1920 


t r'crcicjt" a 1" a c. 


caaacctaac 


aacaatacaa 


gaaaaagtgt 


aegtatagga 


ccaggacaaa 


1980 


r*at~tpfcatac 


aacaaataat 


ataatagggg 


atataagaca 


agcacattgt 


tgtacgagac 


2040 


ccaacaataa 


tacaagaaaa 


agtataagga 


taggaccagg 


acaagcattc 


tatgeaacag 


2100 


gagaaataat 


aggagatata 


agacaagcac 


attgttgcac 


aaggccctac 


aacaatataa 


2160 


gacaaaggac 


ccccatagga 


ctagggcaag 


cactctatac 


aacaagaaga 


atagaagata 


2220 


taagaagagc 


acattgttgt 


accagaccct 


ccaccaatac 


aagaacaagt 


ataegtatag 


2280 



o 
w 

LJ 

: : 

u 



gaccaggaca 


agtattctat 


agaacaggag 


acataacagg 


agatataaga 


aaagcatatt 


2340 


gtggatcctg 


tacaagaccc 


aacaacaata 


caagaaaaag 


aatatcttta 


ggaccaggac 


2400 


gagtatttta 


tacagcagga 


gaaataatag 


gagacatcag 


aaaggcacat 


tgttgtacca 


2460 


gacctaataa 


caatacaaga 


aaaagtataa 


cttttgcacc 


aggacaagcg 


ctctatgcaa 


2520 


caggtgaaat 


aataggagat 


ataagacaag 


cacattgtct 


cgggaacatt 


agtagagcaa 


2580 


aatggaataa 


cactttaaaa 


cagatagata 


gcaaattaag 


agaacaattt 


ggaaataata 


2640 


aaacaataat 


ctttaagcag 


tcctcaggag 


gggacccaga 


aattgtaacg 


cacagtttta 


2700 


attgtggagg 


ggaatttttc 


tactgtaatt 


caacacaact 


gtttaatagt 


acttggttta 


2760 


atagtacttg 


gagtactaaa 


gggtcaaata 


acactgaagg 


aagtgacaca 


atcaccctcc 


2820 


catgcagaat 


aaaacaaatt 


ataaacatgt 


ggcaggaagt 


aggaaaagca 


atgtatgccc 


2880 


ctcccatcag 


tggacaaatt 


agatgttcat 


caaatattac 


agggctgcta 


ttaacaagag 


2940 


atggtggtaa 


tagcaacaat 


gagtccgaga 


tcttcagacc 


tggaggagga 


gatatgaggg 


3000 


acaattggag 


aagtgaatta 


tataaatata 


aagtagtaaa 


aattgaacca 


ttaggagtag 


3060 


cacccaccaa 


ggcaaagaga 


agagtggtgc 


agactagtgc 


agtgggaata 


ggagctttgt 


3120 


tccttgggtt 


cttgggagca 


gcaggaagca 


ctatgggcgc 


agcgtcaatg 


acgctgacgg 


3180 


tacaggccag 


acaattattg 


tctggtatag 


tgcagcagca 


gaacaatttg 


ctgagggcta 


3240 


ttgaggcgca 


acagcatctg 


ttgcaactca 


cagtctgggg 


catcaagcag 


ctccaggcaa 


3300 


gaatcctggc 


tgtggaaaga 


tacctaaagg 


atcaacagct 


cctggggatt 


tggggttgct 


3360 


ctggaaaact 


catttgcacc 


actgctgtgc 


cttggaatgc 


tagttggagt 


aataaatctc 


3420 


tggaacagat 


ttggaataac 


atgacctgga 


tggagtggga 


cagagaaatt 


aacaattaca 


3480 


caagcttaat 


acactcctta 


attgaagaat 


cgcaaaacca 


gcaagaaaag 


aatgaacaag 


3540 


aattattgga 


attagataaa 


tgggcaagtt 


tgtggaattg 


gtttaacata 


acaaattggc 


3600 


tgtggtatat 


aaaattattc 


ataatgatag 


taggaggctt 


ggtaggttta 


agaatagttt 


3660 


ttgctgtact 


ttctgtagtg 


aatagagtta 


ggcagggata 


ttcaccatta 


tcgtttcaga 


3720 


cccacctccc 


aatcccgagg 


ggacccgaca 


ggcccgaagg 


aatagaagaa 


gaaggtggag 


3780 


agagagacag 


agacagatcc 


attcgattag 


tgaacggatc 


cttagcactt 


atctggtaa 


3839 



<210> 53 

<211> 1101 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Modified Env/Tat 



<400> 53 



Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
1 5 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



y 
S 

» 

o 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp 
100 105 HO 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 120 125 



Ala Gly Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val 
130 135 140 



Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 
195 200 205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr He He Val Gin Leu 
210 215 220 



Asn Gin Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser lie Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He 
245 250 255 



Gly Lys He Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg He Gly Pro Gly Gin Thr 
275 280 285 



Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gly Pro 
305 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
355 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 410 415 



Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu lie He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys 
465 470 475 480 



Leu Gly Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie 
485 490 495 



Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie 
500 505 510 



Arg Gin Ala His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
515 ^ 520 525 



He Arg He Gly Pro Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He 
530 " 535 540 



Gly Asp He Arg Gin Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He 
545 550 555 560 



Arg Gin Arg Thr Pro He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg 
565 570 575 



Arg He Glu Asp He Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr 
580 585 590 



Asn Thr Arg Thr Ser He Arg He Gly Pro Gly Gin Val Phe Tyr Arg 
595 600 605 



Thr Gly Asp He Thr Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys 
610 615 620 



Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly 
625 630 635 640 



Arg Val Phe Tyr Thr Ala Gly Glu He He Gly Asp He Arg Lys Ala 
645 650 655 



His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe 
660 665 670 



Ala Pro Gly Gin Ala Leu Tyr Ala Thr Gly Glu He He Gly Asp He 
675 680 685 



Arg Gin Ala His Cys Leu Gly Asn He Ser Arg Ala Lys Trp Asn Asn 
690 695 • 700 



Thr Leu Lys Gin He Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn 
705 710 715 720 



Lys Thr lie He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val 
725 ' 730 735 



Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr 
740 745 750 



Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly 
755 760 765 



Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg He 
770 775 780 



Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala 
785 790 795 800 



Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
805 810 815 



Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu Ser Glu He Phe 
820 825 830 



Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
835 840 845 



Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
850 855 860 



Ala Lys Arg Arg Val Val Gin Thr Ser Ala Val Gly He Gly Ala Leu 
865 870 875 880 



Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser 
885 890 895 



Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin 
900 905 910 



Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu 
915 920 925 



Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg He Leu Ala 
930 935 940 



Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys 
945 950 955 960 



Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp 



u 
a 



965 970 975 



Ser Asn Lys Ser Leu Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu 
980 985 990 



Trp Asp Arg Glu lie Asn Ash Tyr Thr Ser Leu lie His Ser Leu lie 
995 1000 1005 



Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
1010 1015 1020 



Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
1025 1030 1035 



Asn Trp Leu Trp Tyr lie Lys Leu Phe lie Met lie Val Gly Gly 
1040 1045 1050 



Leu Val Gly Leu Arg lie Val Phe Ala Thr His Leu Pro lie Pro 
Q 1055 1060 1065 



Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly Glu 
1070 1075 1080 



fj] Arg Asp Arg Asp Arg Ser lie Arg Leu Val Asn Gly Ser Leu Ala 

1085 1090 1095 



Leu lie Trp 
1100 



<210> 
<211> 
<212> 
<213> 
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<400> 54 
gaattctgca 


acaactgctg 


tttatccatt 


ttcagaattg 


ggtgtcgaca 


tagcagaata 


60 


ggcgttactc 


gacagaggag 


agcaagaaat 


ggagccagta 


gatcctagac 


tagagccctg 


120 


gaagcatcca 


ggaagtcagc 


ctaaaactgc 


ttgtaccaat 


tgctattgta 


aaaagtgttg 


180 


ctttcattgc 


caagtttgtt 


tcataacaaa 


agccttaggc 


atctcctatg 


gcaggaagaa 


240 


gcggagacag 


cgacgaagac 


ctcctcaagg 


cagtcagact 


catcaagttt 


ctctatcaaa 


300 


gcagtaagta 


gtacatgtaa 


tgcaacctat 


acaaatagca 


atagtagcat 


tagtagtagc 


360 


aataataata 


gcaatagttg 


tgtggtccat 


agtaatcata 


gaatatagga 


aaatattaag 


420 



: . : 

LR 

5 



p 



acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


tgagagtgaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 


gtggagatgg 


ggcaccatgc 


540 


tccttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca 


gtctattatg 


600 


gggtacctgt 


gtggaaggaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


atgatacaga 


ggtacataat 


gtttgggcca 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg . 


780 


tagaacagat 


gcatgaggat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 


aattaacccc 


actctgtgtt 


ggagctggta 


gttgtaacac 


ctcagtcatt 


acacaggcct 


900 


gtccaaaggt 


atcctttgag 


ccaattccca 


tacattattg 


tgccccggct 


ggttttgcga 


960 


ttctaaaatg 


taataataag 


acgttcaatg 


gaacaggacc 


atgtacaaat 


gtcagcacag 


1020 


tacaatgtac 


acatggaatt 


aggccagtag 


tatcaactca 


actgctgtta 


aatggcagtc 


1080 


tggcagaaga 


agaggtagta 


attagatctg 


ccaatttcac 


agacaatgct 


aaaaccataa 


1140 


tagtacagct 


gaaccaatct 


gtagaaatta 


attgtacaag 


acccaacaac 


aatacaagaa 


1200 


aaagtatccg 


tatccagaga 


ggaccaggga 


gagcatttgt 


tacaatagga 


aaaataggaa 


1260 


atatgagaca 


agcacattgt 


ctcgggtgta 


ccagacctaa 


caacaataca 


agaaaaagtg 


1320 


tacgtatagg 


accaggacaa 


acattctatg 


caacaggtga 


tataataggg 


gatataagac 


1380 


aagcacattg 


ttgtacgaga 


cccaacaata 


atacaagaaa 


aagtataagg 


ataggaccag 


1440 


gacaagcatt 


ctatgcaaca 


ggagaaataa 


taggagatat 


aagacaagca 


cattgttgca 


1500 


caaggcccta 


caacaatata 


agacaaagga 


cccccatagg 


actagggcaa 


gcactctata 


1560 


caacaagaag 


aatagaagat 


ataagaagag 


cacattgttg 


taccagaccc 


tccaccaata 


1620 


caagaacaag 


tatacgtata 


ggaccaggac 


aagtattcta 


tagaacagga 


gacataacag 


1680 


gagatataag 


aaaagcatat 


tgtggatcct 


gtacaagacc 


caacaacaat 


acaagaaaaa 


1740 


gaatatcttt 


aggaccagga 


cgagtatttt 


atacagcagg 


agaaataata 


ggagacatca 


1800 


gaaaggcaca 


ttgttgtacc 


agacctaata 


acaatacaag 


aaaaagtata 


acttttgcac 


1860 


caggacaagc 


gctctatgca 


acaggtgaaa 


taataggaga 


tataagacaa 


gcacattgtc 


1920 


tcgggtgtac 


cagacctaac 


aacaatacaa 


gaaaaagtgt 


acgtatagga 


ccaggacaaa 


1980 


cattctatgc 


aacaggtgat 


ataatagggg 


atataagaca 


agcacattgt 


tgtacgagac 


2040 


ccaacaataa 


tacaagaaaa 


agtataagga 


taggaccagg 


acaagcattc 


tatgcaacag 


2100 


gagaaataat 


aggagatata 


agacaagcac 


attgttgcac 


aaggccctac 


aacaatataa 


2160 


gacaaaggac 


ccccatagga 


ctagggcaag 


cactctatac 


aacaagaaga 


atagaagata 


2220 





taagaagagc 


acattgttgt 


accagaccct 


ccaccaatac 


aagaacaagt 


atacgtatag 


2280 




gaccaggaca 


agtattctat 


agaacaggag 


acataacagg 


agatataaga 


aaagcatatt 


2340 




qtqqatcctq 


tacaagaccc 


aacaacaata 


caagaaaaag 


aatatcttta 


ggaccaggac 


2400 




gagtatttta 


tacagcagga 


gaaataatag 


gagacatcag 


aaaggcacat 


tgttgtacca 


2460 




gacctaataa 


caatacaaga 


aaaagtataa 


cttttgcacc 


aggacaagcg 


ctctatgcaa 


2520 




caggtgaaat 


aataggagat 


ataagacaag 


cacattgtct 


cgggaacatt 


agtagagcaa 


2580 




aatggaataa 


cactttaaaa 


cagatagata 


gcaaattaag 


agaacaattt 


ggaaataata 


2640 




aaacaataat 


ctttaagcag 


tcctcaggag 


gggacccaga 

zj zj 


aattgtaacg 


cacagtttta 


2700 




attqtqqaqq 


ggaatttttc 


tactgtaatt 


caacacaact 


gtttaatagt 


acttggttta 


2760 




atagtacttg 


gagtactaaa 


gggtcaaata 


acactgaagg 


aagtgacaca 


atcaccctcc 


2820 




catgcagaat 


aaaacaaatt 


ataaacatgt 


ggcaggaagt 


aggaaaagca 


atgtatgccc 


2880 




ctcccatcag 


tggacaaatt 


agatgttcat 


caaatattac 


agggctgcta 


ttaacaagag 


2940 




atggtggtaa 


tagcaacaat 


gagtccgaga 


tcttcagacc 


tggaggagga 


gatatgaggg 


3000 


fad 


acaattggag 


aagtgaatta 


tataaatata 


aagtagtaaa 

ZJ ZJ 


aattgaacca 


ttaggagtag 


3060 


t . : 


cacccaccaa 


ggcaaagaga 


agagtggtgc 


agactagtgc 


agtgggaata 


ggagctttgt 


3120 


s 

6 


tccttgggtt 


cttgggagca 


gcaggaagca 


ctatgggctg 


cacgtcaatg 


acgctgacgg 


3180 


tacaggccag 


acaattattg 


tctgatatag 


tgcagcagca 


gaacaatttg 


ctgagggcta 


3240 


; - 
j— - 

s - 


ttqaqqcqca 


acagcatctg 


ttgcaactca 


cagtctgggg 


catcaaacag 


ctccaggcaa 


3300 


P 


gaatcctggc 


tqtqqaaaqa 


tacctaaagg 


atcaacagct 


cctggggatt 


tggggttgct 


3360 


O 


ctggaaaact 


catttgcacc 


actgctgtgc 


cttggaatgc 


tagttggagt 


aataaatctc 


3420 


tggaacagat 


ttggaataac 


atgacctgga 


tqgagtggga 

J J ZJ J mJ -J 


cagagaaatt 


aacaattaca 


3480 




caagcttaat 


acactcctta 


attgaagaat 


cgcaaaacca 


gcaagaaaag 


aatgaacaag 


3540 




aattattgga 


attagataaa 


tgggcaagtt 


tgtgqaattq 


gtttaacata 


acaaattggc 


3600 




tgtggtatat 


aaaattattc 


ataatgatag 


taggaggctt 


ggtaggttta 


agaatagttt 


3660 




ttgctgtact 


ttctatagtg 


aatagagtta 


ggcagggata 


ttcaccatta 


tcgtttcaga 


3720 




cccacctccc 


aatcccgagg 


qqacccqaca 


ggcccgaagg 

ZJ ZJ ZJ ZJ ZJ 


aatagaagaa 


gaaggtggag 


3780 




agagagacag 


agacagatcc 


attcgattag 


tgaacggatc 


cttagcactt 


atctgggacg 


3840 




atctgcggag 


cctgtgcctc 


ttcagctacc 


accgcttgag 


agacttactc 


utgatt-gtaa 






cgaggattgt 


ggaacttctg 


ggacgcaggg 


ggtgggaagc 


cctcaaatat 


tggtggaatc 


3960 




tcctacagta 


ttggagtcag 


gaactaaaga 


atagtgctgt 


taacttgctc 


aatgccacag 


4020 




ccatagcagt 


agctgagtaa 










4040 



<210> 55 
<211> 1186 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Modified Env/Tat/Rev 
<400> 55 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
.35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 " 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie He Ser Leu Trp 
100 105 110 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 120 125 



Ala Gly Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val 
130 135 140 



Ser Phe Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



He Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 



195 



200 



205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr He He Val Gin Leu 
210 215 22 0 



Asn Gin Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser He Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He 
245 250 255 



Gly Lys He Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg He Gly Pro Gly Gin Thr 
275 280 285 



Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gly Pro 
305 ~ 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
355 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 410 415 



Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu He He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu lie He Gly Asp He Arg Gin Ala His Cys 
465 470 475 480 



Leu Gly Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg He 
485 490 495 



Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Asp He He Gly Asp He 
500 505 510 



Arg Gin Ala His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
515 ' 520 525 



He Arg He Gly Pro Gly Gin Ala Phe Tyr Ala Thr Gly Glu He lie 
530 ~ 535 540 



Gly Asp He Arg Gin Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He 
545 550 555 560 



Arg Gin Arg Thr Pro He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg 
565 570 575 



Arg He Glu Asp He Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr 
580 585 590 



Asn Thr Arg Thr Ser He Arg He Gly Pro Gly Gin Val Phe Tyr Arg 
595 600 605 



Thr Gly Asp He Thr Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys 
610 615 620 . 



Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly 
625 630 635 640 



Arg Val Phe Tyr Thr Ala Gly Glu He He Gly Asp He Arg Lys Ala 
645 650 655 



His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe 
660 665 670 



Ala Pro Gly Gin Ala Leu Tyr Ala Thr Gly Glu He He Gly Asp He 
675 680 685 



Arg Gin Ala His Cys Leu Gly Asn He Ser Arg Ala Lys Trp Asn Asn 
690 695 700 



Thr Leu Lys Gin He Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn 
705 710 715 720 



Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val 
725 730 735 



Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr 
740 745 750 



Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly 
755 760 765 



Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys Arg He 
770 * 775 780 



Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala 
785 790 795 800 



Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
805 810 815 



Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu Ser Glu He Phe 
820 * 825 830 



Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
835 840 845 



Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
850 855 860 



Ala Lys Arg Arg Val Val Gin Thr Ser Ala Val Gly He Gly Ala Leu 
865 ~ 870 875 880 



Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Cys Thr Ser 
885 890 895 



Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Asp He Val Gin 
900 905 910 



Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu 
915 920 925 



Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala 
930 ~ ~ 935 940 

Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys 
945 950 955 960 

Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp 
965 " 970 975 

Ser Asn Lys Ser Leu Glu Gin He Trp Asn Asn Met Thr Trp Met Glu 
980 985 990 

Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His Ser Leu He 
995 1000 1005 

Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
1010 1015 1020 

Q Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 

O 1025 1030 1035 

□ 

E 7. 
Ill 

a 



Asn Trp Leu Trp Tyr He Lys Leu Phe He Met He Val Gly Gly 
1040 1045 1050 



Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn 
1055 1060 1065 



I 

s 

M 

P Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu 

M 1070 ~ 1075 1080 

Pro He Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu 
1085 1090 1095 

Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg Leu Val Asn Gly 
1100 ~ " 1105 1110 

Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe 
1115 1120 1125 

Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg He 
1130 1135 1140 

Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp 
1145 1150 1155 



Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 



1160 



1165 



1170 



s . 5 



Val Asn Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu 
1175 1180 1185 

/ 

<210> 56 
<211> 507 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 56 



atgttcttta 


gggaagatct 


ggccttccta 


caagggaagg 


ccagggaatt 


ttcttcagag 


60 


cagaccagag 


ccaacagccc 


caccatttct 


tcagagcaga 


ccagagccaa 


cagccccacc 


120 


agaagagagc 


ttcaggtctg 


gggtagagac 


aacaactccc 


cctcagaagc 


aggagccgat 


180 


agacaaggaa 


ctgtatcctt 


taacttccct 


cagatcactc 


tttggcaacg 


acccctcgtc 


240 


acaataaaga 


taggggggca 


actaaaggaa 


gctctattag 


atacaggagc 


agatgataca 


300 


gtattagaag 


aaatgagttt 


gccaggaaga 


tggaaaccaa 


aaatgatagg 


gggaattgga 


360 


ggttttatca 


aagtaagaca 


gtatgatcag 


atactcatag 


aaatctgtgg 


acataaagct 


420 


ataggtacag 


tattagtagg 


acctacacct 


gtcaacataa 


ttggaagaaa 


tctgttgact 


480 


cagattggtt 


gcactttaaa 


tttttaa 








507 



in 

2 <210> 57 

L±, <211> 168 

<212> PRT 

J! <213> Human immunodeficiency virus type 1 

0 

M <400> 57 

0 

Met Phe Phe Arg Glu Asp Leu Ala Phe Leu Gin Gly Lys Ala Arg Glu 
1 5 10 15 

Phe Ser Ser Glu Gin Thr Arg Ala Asn Ser Pro Thr He Ser Ser Glu 
20 25 30 

Gin Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gin Val Trp Gly 
35 40 45 

Arg Asp Asn Asn Ser Pro Ser Glu Ala Gly Ala Asp Arg Gin Gly Thr 
50 55 60 

Val Ser Phe Asn Phe Pro Gin He Thr Leu Trp Gin Arg Pro Leu Val 
65 70 75 80 

Thr He Lys He Gly Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly 
85 90 95 



Ala Asp Asp Thr Val Leu Glu Glu Met Ser Leu Pro Gly Arg Trp Lys 
100 105 110 



Pro Lys Met lie Gly Gly lie Gly Gly Phe lie Lys Val Arg Gin Tyr 
115 J 120 125 

Asp Gin He Leu He Glu He Cys Gly His Lys Ala He Gly Thr Val 
130 135 140 

Leu Val Gly Pro Thr Pro Val Asn He He Gly Arg Asn Leu Leu Thr 
145 " 150 155 160 

Gin He Gly Cys Thr Leu Asn Phe 
165 

<210> 58 
<211> 1800 
<212> DNA 

<213> Artificial sequence 



<220> 



<223> Gag- 


-PI 










<400> 58 
atgggtgcga 


gagcgtcagt 


attaagcggg 


ggagaattag 


atcgatggga 


aaaaattcgg 


ttaaggccag 


ggggaaagaa 


aaaatataaa 


ttaaaacata 


tagtatgggc 


aagcagggag 


ctagaacgat 


tcgcagttaa 


tcctggcctg 


ttagaaacat 


cagaaggctg 


tagacaaata 


ctgggacagc 


tacaaccatc 


ccttcagaca 


ggatcagaag 


aacttagatc 


attatataat 


acagtagcaa 


ccctctattg 


tgtgcatcaa 


aggatagaga 


taaaagacac 


caaggaagct 


ttagacaaga 


tagaggaaga 


gcaaaacaaa 


agtaagaaaa 


aagcacagca 


agcagcagct 


gacacaggac 


acagcagtca 


ggtcagccaa 


aattacccta 


tagtgcagaa 


catccagggg 


caaatggtac 


atcaggccat 


atcacctaga 


actttaaatg 


catgggtaaa 


agtagtagaa 


gagaaggctt 


tcagcccaga 


agtaataccc 


atgttttcag 


cattatcaga 


aggagccacc 


ccacaagatt 


taaacaccat 


gctaaacaca 


gtggggggac 


atcaagcagc 


catgcaaatg 


ttaaaagaga 


ccatcaatga 


ggaagctgca 


gaatgggata 


gagtacatcc 


agtgcatgca 


gggcctattg 


caccaggcca 


gatgagagaa 


ccaaggggaa 


gtgacatagc 


aggaactact 


agtacccttc 


aggaacaaat 


aggatggatg 


acaaataatc 


cacctatccc 


agtaggagaa 


atttataaaa 


gatggataat 


cctgggatta 


aataaaatag 


taagaatgta 


tagccctacc 


agcattctgg 


acataagaca 


aggaccaaaa 


gaacctttta 


gagactatgt 


agaccggttc 



tataaaactc 


taagagccga 


gcaagcttca 


caggaggtaa 


aaaattggat 


gacagaaacc 


960 




aaaatgcgaa 


cccagattgt 


aagactattt 


taaaagcatt 


gggaccagcg 


1020 


gctacactag 


aagaaatgat 


qacaqcatqt 


caqqqaqtaq 


qaggacccqg 

ZJ ZJ ZJ ZJ ZJ 


ccataaggca 


1080 


aoaqttttqq 


ctgaagcaat 


gagccaagta 


acaaatacag 


ctaccataat 


gatgcagaga 


1140 


ggcaatttta 


ggaaccaaag 


aaagatggtt 


aagtgtttca 


attgtggcaa 


agaagggcac 


1200 


acagccagaa 


attqcaqqqc 


ccctaggaaa 


aaqqqctgtt 

ZJ ZJ ZJ —i 


ggaaatgtgg 


aaaggaagga 


1260 


caccaaatga 


aagattgtac 


tgagagacag 


gctaatttct 


ttagggaaga 


tctggccttc 


1320 


ctacaaggga 


aaaccaqqqa 


attttcttca 


gagcagacca 


gagccaacag 


ccccaccatt 


1380 


tcttcaaaac 


agaccagagc 


caacagcccc 


accagaagag 


agcttcaggt 


ctqgggtaga 

ZD ZJ ZJ ZJ ZJ 


1440 


gacaacaact 


ccccctcaga 


aqcaqqaqcc 


gatagacaag 


gaactgtatc 


ctttaacttc 


1500 


cctcagatca 


ctctttggca 


acgacccctc 


gtcacaataa 


agataggggg 

ZJ ZJ ZJ ZJ ZJ ZJ 


gcaactaaag 


1560 


gaagctctat 


tagatacagg 


agcagatgat 


acagtattag 


aagaaatgag 


tttgccagga 


162 0 


agatggaaac 


caaaaatgat 


agggggaatt 


ggaggtttta 


tcaaagtaag 


acagtatgat 


1680 


cagatactca 


tagaaatctg 


tggacataaa 


gctataggta 


cagtattagt 


aggacctaca 


1740 


cctgtcaaca 


taattggaag 


aaatctgttg 


actcagattg 


gttgcacttt 


aaatttttaa 


1800 



<210> 59 

<211> 599 

<212> PRT 

<213> Artificial sequence 



Uri <220> 

H> <223> Gag-PI 



<400> 59 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10, 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
20 25 30 



His He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
35 40 45 



Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu 
50 55 " 60 



Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp 
85 90 95 



Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys 
100 105 110 



Lys Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val 
115 120 125 



Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His 
130 ' 135 140 



Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 160 



Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 
165 170 175 



Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
180 185 190 



Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 
195 200 205 



Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala 
210 215 220 



Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr 
225 230 235 240 



Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He 
245 250 255 



Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys 
260 265 270 



He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly 
275 ' 280 285 



Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
290 295 300 



Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
305 310 315 320 



Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala 



325 



330 



335 



Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
340 345 350 



Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
355 * 360 365 



Gin Val Thr Asn Thr Ala Thr lie Met Met Gin Arg Gly Asn Phe Arg 
370' 375 380 



Asn Gin Arg Lys Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 
385 390 395 400 



Thr Ala Arg Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 
405 410 415 



Gly Lys Glu Gly His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn 
420 ~ 425 430 



Phe Phe Arg Glu Asp Leu Ala Phe Leu Gin Gly Lys Ala Arg Glu Phe 
435 440 445 



Ser Ser Glu Gin Thr Arg Ala Asn Ser Pro Thr lie Ser Ser Glu Gin 
450 455 460 



Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gin Val Trp Gly Arg 
465 470 475 480 



Asp Asn Asn Ser Pro Ser Glu Ala Gly Ala Asp Arg Gin Gly Thr Val 
485 490 495 



Ser Phe Asn Phe Pro Gin lie Thr Leu Trp Gin Arg Pro Leu Val Thr 
500 505 510 



lie Lys lie Gly Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala 
515 ' 52 0 52 5 



Asp Asp Thr Val Leu Glu Glu Met Ser Leu Pro Gly Arg Trp Lys Pro 
530 535 540 



Lys Met He Gly Gly He Gly Gly Phe He Lys Val Arg Gin Tyr Asp 
545 550 555 560 



Gin He Leu He Glu He Cys Gly His Lys Ala He Gly Thr Val Leu 
565 570 575 



* 



Val Gly Pro Thr Pro Val Asn He He Gly Arg Asn Leu Leu Thr Gin 
580 585 590 



He Gly Cys Thr Leu Asn Phe 





595 


<210> 


60 


<211> 


51 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


60 



aaatcaaccg gaattgaatt ccctcgggtg taccagacct aacaacaata c 51 



Q 

jj i n 5 

6 
w 

W 

Ul attgttgggt ctcgtacaac aatgtgcttg tcttatatcc cc 42 

> - 

==3 



<210> 


61 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


61 



<210> 


62 


<211> 


41 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


62 



ggggatataa gacaagcaca ttgtacgaga cccaacaata c 41 



<210> 


63 


<211> 


39 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


63 



gttgtagggc cttgtgcaac aatgtgcttg tcttatatc 3 9 



<210> 64 
<211> 39 
<212> DNA 



<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 64 

gatataagac aagcacattg ttgcacaagg ccctacaac 



<210> 


65 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


65 



ggtggagggt ctggtacaac aatgtgctct tcttat 



<210> 66 

<211> 36 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> PCR primer 



<400> 66 

ataagaagag cacattgttg taccagaccc tccacc 



<210> 67 

<211> 47 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> PCR primer 



<400> 67 

gtattgttgt tgggtcttgt acaacaatat gcttttctta tatctcc 



<210> 68 

<211> 47 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 68 

ggagatataa gaaaagcata ttgttgtaca agacccaaca acaatac 



<210> 69 

<211> 39 

<212> DNA 

<213> Artificial 



sequence 



o 
n 



•Vc? 
US 



<220> 

<223> PCR primer 
<400> 69 

gttattaggt ctggtacaac aatgtgcctt tctgatgtc 39 



<210> 


70 


<211> 


39 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


70 



gacatcagaa aggcacattg ttgtaccaga cctaataac 39 



<210> 


71 


<211> 


54 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


71 



aataaactag tctagacccc cgagtctaga acaatgtgct tgtcttatat ctcc . 54 



<210> 


72 


<211> 


7 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


MMLV SD site 


<400> 


72 


aggtaag 


<210> 


73 


<211> 


9 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


MMLV SA site 


<400> 


73 


ctgctgcag 


<210> 


74 


<211> 


90 


<212> 


DNA 


<213> 


Artificial sequence 



• 



<220> 

<223> DNA encoding gpl20 signal peptide 
<400> 74 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 
ctccttggga tgttgatgat ctgtagtgct 



<210> 75 
<211> 129- 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA encoding gp41 transmembrane domain 
<400> 75 

ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 
gtagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaatc 



ccgagggga 



